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EXECUTIVE  SUMMARY 


OVERVIEW 

The  goal  of  this  effort  is  to  develop  a  firm  theoretical  foundation  for  the  acoustic  parabolic  equation 
(PE)  used  in  the  presence  of  penetrable  rough  interfaces.  As  the  interface  roughness  extends  through  the 
wavelength  scale,  it  induces  Bragg  scattering  (i.e.,  it  behaves  like  a  diffraction  grating).  This  is  a 
nontrivial,  phase-sensitive  problem  that  involves  theoretical  and  computational  challenges  that  go  beyond 
those  found  in  problems  to  which  the  PE  is  most  typically  applied.  A  satisfactory  formalism  that 
addresses  Bragg  scattering  should  fully  integrate  the  parabolic  equation  with  field  and  rough  surface 
scattering  theories.  Such  a  formalism  is  available  for  impenetrable  rough  surfaces  (i.e.,  infinite  density 
jumps),  but  the  traditional  PE  formalism  for  a  finite  density  jump  is  based  on  ad  hoc  arguments  rather 
than  on  a  formal  development  grounded  in  these  theories.  In  this  report,  the  Foldy-Wouthuysen 
transformation  is  used  to  design  a  parabolic  equation  formalism  that  addresses  this  challenge.  The 
associated  parabolic  equations  predict  phenomena  not  previously  noted,  and  two  of  these  are  examined  in 
detail: 

1 .  the  classical  equivalent  of  vacuum  polarization 

2.  the  buffering  of  jumps  in  the  downrange  flux  at  a  density  jump. 

The  former  involves  interesting  physics,  but  it  is  a  modest  effect  and  there  are  no  immediate 
applications  related  to  underwater  sonar.  The  study  of  the  classical  “vacuum  polarization”  is  primarily 
significant  for  the  insights  it  provides  about  the  nature  of  the  parabolic  equations  generated  by  our  formal 
approach.  The  second  topic  of  this  study,  on  the  other  hand,  leads  to  significant  practical  applications. 
Along  an  interface  where  the  density  jumps,  the  full- wave  problem  predicts  a  jump  in  the  downrange 
flux,  but  the  parabolic  equation  generated  by  the  Foldy-Wouthuysen  transformation  buffers  this 
discontinuity  by  absorbing  it  into  the  higher-order  boundary  conditions.  The  new  formalism  is  free  of  the 
ad  hoc  fixes  that  have  characterized  the  parabolic  equation  methods  currently  used  in  the  vicinity  of  a 
jump  in  the  density.  This  is  significant  because  the  techniques  used  to  adapt  the  PE  to  rough  impenetrable 
interfaces  ultimately  rely  on  conformal  mappings,  a  local  method  of  images,  perturbation  theory,  or  some 
similar  distortion  of  the  range -independent  problem,  while  the  ad  hoc  fixes  currently  employed  at  a 
density  jump  prevent  the  use  of  such  distortions.  The  new  PE,  on  the  other  hand,  allows  interfaces  where 
the  density  jumps  (such  as  the  ocean  bottom)  to  be  distorted  into  rough  ones,  and  so  it  is  ideally  suited 
for  the  modeling  of  (forward)  scattering  from  multiscale  rough  surfaces.  The  formalism  can  also  be  used 
to  generate  stochastic  equations  in  circumstances  where  it  has  been  impossible  until  now  to  do  so. 

TECHNICAL  APPROACH 

The  parabolic  equation  (PE)  is  widely  used  to  model  the  propagation  of  classical  fields  in  ducted 
environments.  This  equation  is  essentially  the  Schrodinger  equation  for  a  classical  field,  where  the  range 
plays  the  role  of  the  time.  One  of  the  principal  challenges  in  constructing  this  type  of  1-way  stepping 
algorithm  is  the  incorporation  of  multiscale  stochastic  range-dependent  fluctuations  of  the  environment 
near  interfaces  where  the  environmental  parameters  or  their  gradients  are  discontinuous.  Atomic  physics 
suggests  a  technique  for  doing  so.  As  the  hydrogen  atom  is  advected  by  vacuum  fluctuations,  the  electron 
field  encounters  a  turbulent  environment  similar  to  that  encountered,  for  example,  by  an  underwater 
acoustic  field.  In  Welton’s  semi-classical  model,  the  atomic  Lamb  shift  is  generated  by  a  contact  potential 
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that  is  a  direct  consequence  of  time-averaging  the  interaction  of  the  electron  field  with  these  fluctuations. 
This  contact  potential  at  the  nucleus  of  the  atom  is  connected  to  a  singularity  in  the  gradient  of  the 
potential  that  is  step-like  in  the  sense  that  its  divergence  is  a  8  -function.  This  contact  potential  shifts  the 
energy  levels  (i.e.,  eigenvalues)  for  the  stochastic  coherent  (i.e.,  average)  field.  A  Schrodinger  equation 
derived  using  the  Foldy-Wouthuysen  transformation  introduces  an  aspect  of  this  phenomenon  that  is  often 
missed  by  semi-classical  treatments.  Although  the  fluctuations  in  3-dimensional  physical  space  are 
imposed  by  hand,  the  resultant  time-dependent  potential  also  induces  virtual  fluctuations  in  the  time 
domain,  and  this  effect  appears  in  the  form  of  new  terms  that  appear  in  the  Schrodinger  equation  derived 
using  the  Foldy-Wouthuysen  transformation.  Even  before  time  averaging,  the  fluctuating  singularity  at 
the  nucleus  has  been  buffered  by  a  cloud  of  virtual  particle  pairs.  Using  the  terminology  of  field  theory, 
“vacuum  polarization”  has  “dressed”  the  bare  singularity.  The  “vacuum  polarization”  effect  also  induces  a 
8  -function,  and  it  provides  a  correction  to  the  basic  Lamb  shift  effect. 

This  provides  a  context  for  using  the  parabolic  equation  to  explore  acoustic  effects  directly  analogous 
to  the  Lamb  shift.  Consider  a  sound  speed  profile  that  contains  singularities  such  as  jumps  in  the  sound 
speed  and/or  its  gradient.  Take  a  stochastic  average,  and  consider  the  coherent  (i.e.,  average)  field.  The 
averaging  process  leaves  us  with  contact  potentials  along  the  range-independent  average  of  the  interfaces, 
where  the  sound  speed  profile  contains  singularities.  By  taking  transverse  integrals,  we  see  that  the 
contact  potentials  modify  the  boundary  conditions  on  the  wave  function  at  the  interface.  We  can 
decompose  the  coherent  field  solutions  into  modes  (i.e.,  eigenfunctions).  In  the  acoustic  Lamb  shift,  the 
downrange  components  of  the  wavevectors  characterizing  the  modes  are  the  eigenvalues,  and  so  these 
take  the  place  of  the  energy  in  the  atomic  problem,  and  so  it  is  shifted  by  the  changes  in  the  boundary 
conditions.  In  a  realistic  shallow-water  scenario,  this  in  turn  shifts  the  features  of  the  transmission  loss 
curve.  While  this  can  be  “significant”  in  a  formal  sense  (10  dB  or  more  at  a  fixed  location  if  we  also  go 
on  to  include  for  this  particular  effect  the  contribution  from  a  density  jump  at  the  ocean  bottom),  given  the 
nature  of  field  experiments  and  current  sonar  implementations,  the  data  are  rarely  taken  in  a  way  that 
would  readily  lend  itself  to  an  examination  of  this  phenomenon.  Therefore  currently,  the  primary 
significance  of  the  classical  Lamb  shift  lies  in  what  in  tells  us  about  the  nature  of  the  parabolic  equations 
generated  by  our  formal  approach.  The  dominant  contribution  to  the  Lamb  shift  results  from  the  smearing 
of  a  stochastic  rough  surface,  but  the  Foldy-Wouthuysen  transformation  applied  to  the  acoustic  wave 
equation  also  generates  a  classical  manifestation  of  the  “vacuum  polarization”  correction  to  the  Lamb 
shift.  As  with  the  quantum  problem,  this  effect  provides  only  a  modest  contribution  to  the  classical  Lamb 
shift,  but  it  is  significant  because  it  is  intrinsic  to  the  parabolic  approximation  and  so  the  smearing  already 
occurs  in  the  deterministic  problem.  While  this  is  the  first  example  in  acoustics  of  the  Foldy-Wouthuysen 
transformation  “dressing”  a  bare  singularity,  the  formalism  similarly  addresses  other  singularities  that  are 
apparently  unique  to  classical  fields.  Most  importantly,  it  “dresses”  the  jump  in  downrange  flux 
associated  with  a  density  discontinuity,  thus  eliminating  the  need  for  the  ad  hoc  fixes  usually  employed 
whenever  the  parabolic  equation  is  used  to  model  an  acoustic  field  near  a  2-fluid  interface  (such  as  a 
penetrable  rough  ocean  bottom).  The  formalism  reproduces  as  special  cases  the  most  successful  methods 
currently  in  use,  and  puts  them  on  a  firm  theoretical  foundation.  It  also  suggests  higher-order  corrections 
to  these  results.  Most  significantly,  by  buffering  the  singularity,  the  new  formalism  imposes  well-behaved 
boundary  conditions  along  the  interface  that  can  be  incorporated  naturally  into  a  stepping  algorithm  - 
even  along  a  sloping  interface.  In  this  way,  the  formalism  is  uniquely  applicable  to  multiscale 
deterministic  rough  interfaces,  he  second-order  deterministic  theory  is  specifically  examined  in  this 
report.  The  formalism  moreover  allows  consideration  of  stochastic  rough  ocean  bottoms  in  realistic 
scenarios,  where  it  has  not  been  previously  possible  to  do  so. 

Similar  issues  appear  with  electromagnetic  and  elastodynamic  fields,  and  this  technique  can  be  applied 
to  these  problems  as  well.  Here,  the  creative  insights  needed  to  adapt  the  formalism  to  these  cases  are 
addressed. 
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USING  THE  FOLDY-WOUTHUYSEN  TRANSFORMATION 
TO  DERIVE  ACOUSTIC  PARABOLIC  EQUATIONS  THAT 
PROPERLY  “DRESS”  DISCONTINUITIES 


1.  INTRODUCTION  AND  ROADMAP 

A  new  systematic  technique  for  deriving  the  parabolic  equation  (PE)  for  classical  fields  has  been 
developed  over  the  last  several  years.  This  effort  began  with  a  novel  derivation  of  the  acoustic  PE  that 
introduced  a  new  physical  effect  into  the  context  of  long-range  propagation  in  the  underwater  sound 
channel  [1],  Over  the  ensuing  years,  this  line  of  research  has  spawned  a  wide  range  of  new  developments. 
Preliminary  reports  of  aspects  of  this  long-term  effort  have  been  presented  orally  [2-6],  while  in  two 
recent  articles  by  the  author  [7,  8]  the  result  of  this  effort  that  is  of  most  immediate  practical  concern  was 
discussed:  an  acoustic  PE  that  inherently  buffers  density  discontinuities  in  a  manner  dictated  by  a  precise 
physical  theory.  The  practical  relevance  of  this  result  lies  in  the  fact  that  it  addresses  a  key  challenge 
confronting  the  PE  technique:  the  need  to  model  scattering  from  and  propagation  through  penetrable 
rough  interfaces  characterized  by  a  density  jump  (such  as  are  found  at  the  ocean  bottom). 

This  NRL  report  provides  a  comprehensive  treatment  covering  the  entire  body  of  work  performed  on 
this  topic  during  the  last  seven  years.  It  provides  a  new  formal  development  of  the  physical  and 
mathematical  theory  associated  with  the  range-dependent  PE.  The  narrative  below  is  structured  in  a  way 
that  provides  insight  into  the  evolution  of  the  formalism.  It  involves  both  a  deeper  look  at  topics 
introduced  in  previous  papers  [1,  7-8]  and  a  look  at  new  topics. 

After  the  effort  motivation  is  described  in  Section  2,  Section  3  develops  the  tools  that  lie  at  the  core  of 
the  new  approach.  Section  4  introduces  jumps  in  the  sound  speed,  and  looks  at  a  comparatively 
straightforward  phenomenon  associated  with  such  jumps:  the  classical  Lamb  shift,  including  a  small 
correction  term  that  is  very  significant  for  what  it  reveals  about  the  basic  nature  of  the  PE.  Section  5 
introduces  the  density  jump,  and  expands  the  analysis  of  Section  4  to  include  the  density  jump.  Section  6 
leverages  the  insights  gained  in  Sections  3  through  5  to  develop  a  full  understanding  of  the  issues  related 
to  the  PE  in  the  vicinity  of  a  density  j  ump,  while  Section  7  extends  the  basic  technique  to  electromagnetic 
and  elastodynamic  fields.  The  discussion  is  summarized  by  Section  8. 

The  rest  of  this  section  provides  a  detailed  roadmap  of  the  report. 

Section  2.  This  section  aims  to  establish  the  need  for  designing  a  PE  adapted  for  use  near  a  penetrable 
rough  surface,  and  to  explore  its  potential  significance  to  underwater  acoustics.  Section  2. 1  provides  an 
overview  of  the  relevant  aspects  of  the  PE  formalism,  while  Section  2.2  discusses  the  basic  approaches 
used  to  adapt  this  formalism  to  an  interface  where  the  density  jumps.  Section  2.3  examines  previous 
adaptations  of  the  PE  formalism  specifically  tailored  to  the  very  tricky  problem  of  Bragg  scattering3  from 
a  rough  interface  associated  with  a  density  jump,  and  then  identifies  the  shortcomings  that  have  limited 
the  utility  of  these  approaches.  This  state  of  affairs  has  resulted  in  a  dearth  of  adequate  PE  models  for  the 


a  Bragg  scattering  occurs  when  the  surface  has  wavelength-scale  roughness,  and  consequently  scatters  the  field  like  a 
diffraction  grating. 
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very  important  problem  of  roughness-induced  acoustic  penetration  into  the  ocean  bottom.  The  primary 
goal  of  this  work  is,  therefore,  to  design  a  new  PE  that  is  well-suited  for  this  problem. 

Figure  1  provides  a  more  detailed  schematic  overview  of  Section  2.  The  basic  PE  in  the  water  column 
essentially  corresponds  to  a  factorization  of  the  Helmholtz  equation.  However,  when  range  dependence  is 
added,  the  PE  actually  propagates  an  auxiliary  field  that  roughly  corresponds  to  the  square  root  of  the 
downrange  flux.  Many  common  ocean  bottoms  consist  of  sand  and  mud,  which  are  typically  modeled  as 
fluids.  Then,  the  associated  density  jump  is  treated  using  one  of  two  formal  approaches.  The  first  involves 
a  global  change  of  variables,  and  then  replaces  the  interface  with  a  gradual  transition.  The  second 
approach  breaks  the  interface  into  stair  steps,  and  once  again,  an  auxiliary  field  corresponding  roughly  to 
the  square  root  of  the  downrange  flux  is  conserved  at  the  vertical  interfaces.  Because  of  its  success  in 
matching  benchmark  solutions,  the  second  method  is  now  usually  used  to  model  standard  problems  in 
underwater  acoustics.  However,  when  the  interface  is  rough  through  the  wavelength  scale,  the  physics  of 
the  problem  changes.  The  standard  approach  ceases  to  be  practicable,  and  it  is  consequently  preempted  by 
a  host  of  alternate  approaches.  None  of  these  alternatives  is  entirely  satisfactory.  This  report,  therefore, 
provides  a  new  approach  that  is  designed  for  the  rough  interface  problem  and  solves  the  shortcomings  of 
both  the  stair  step  approach  and  the  alternatives  that  have  been  used  for  the  rough  interface  problem. 

Section  3.  The  formal  approach  used  to  develop  the  new  PE  theory  is  examined  here.  The  development 
begins  with  the  introduction  of  the  Foldy-Wouthuysen  (FW)  transformation  in  Section  3.1.  This 
transformation  was  originally  developed  in  order  to  connect  the  full  relativistic  theory  of  the  hydrogen 
atom  to  the  widely  used  non-relativistic  theory  based  on  the  Schrodinger  equation,  and  here  it  is  adapted 
to  the  similar  problem  of  connecting  the  Helmholtz  equation  for  an  acoustic  field  with  the  corresponding 
PE.  Section  3.2  discusses  the  direct  acoustic  equivalent  of  the  atomic  Lamb  shift,  which  occurs  when  a 
cusp  in  the  sound  speed  (i.e.,  a  discontinuity  in  the  sound  speed  gradient)  is  advected  by  a  stochastic 
rough  surface.  The  dominant  component  of  the  classical  Lamb  shift  is  associated  with  roughness-induced 
smearing  of  the  interface  in  the  transverse  direction,  but  it  is  also  shown  that  the  new  terms  generated  by 
the  FW  transformation  produce  a  correction  to  the  Lamb  shift  associated  with  uprange/downrange 
smearing.  These  assertions  are  backed  up  by  the  development  of  a  “toy  model”  of  the  atomic  Lamb  shift 
that  shows  the  connection  between  the  classical  and  the  quantum  mechanical  Lamb  shift.  Our 
consideration  of  the  sound  speed  cusp  along  a  stochastic  rough  surface  is  unique  in  that  it  both  has  a  clear 
precedent  and  also  involves  the  new  contact  potentials  imposed  by  the  FW  transformation  along  a 
penetrable  rough  surface.  Subsequently  in  Section  3.3,  the  systematic  construction  of  a  new  downrange 
stepping  procedure  begins  with  an  examination  of  the  mechanics  of  discretizing  a  PE  in  order  to  create  a 
downrange  stepping  algorithm  near  a  sloped  penetrable  interface.  (This  issue  of  interface  slope  was 
sidestepped  by  the  approaches  discussed  in  Section  2.3).  Section  3.3.1  discusses  the  discrete  form  of  the 
PE,  while  Section  3.3.2  addresses  the  crucial  problem  of  evaluating  the  Hamiltonian  at  the  interface  in  the 
discrete  problem.  It  is  shown  that  higher-order  boundary  conditions  are  a  key  ingredient  of  this  stepping 
procedure.  The  existence  of  such  conditions  has  occasionally  been  noted  in  the  past,  but  no  procedure  for 
determining  (let  alone  exploiting)  them  has  met  with  wide  acceptance.  Section  3.3.3  outlines  the  way  that 
the  FW  transformation  imposes  boundary  conditions  at  an  interface.  These  boundary  conditions  are 
generated  by  contact  potentials,  which  emerge  from  the  FW  procedure.  The  examination  in  Section  3.2  of 
the  contact  potentials  generated  by  the  classical  Lamb  shift  serves  to  lay  the  groundwork  for  the  expanded 
role  that  contact  potentials  play  in  the  context  of  Section  3.3.3. 

Section  4.  This  section  examines  the  issue  of  an  interface,  where  the  sound  speed  itself  (and  not  just  its 
gradient)  is  discontinuous.  At  the  core  of  this  study  once  again  lie  the  two  ways  in  which  interface 
roughness  buffers  the  singularity  at  a  sound  speed  cusp-uprange/downrange  and  transverse  smearing  (i.e., 
the  classical  Lamb  shift)-and  examines  these  phenomena  in  the  context  of  the  sound  speed  jump.  Our 
study  of  these  phenomena  will  provide  a  vehicle  allowing  us  to  further  develop  various  aspects  of 
the  formal  structure  that  were  introduced  in  Section  3.  In  Section  4.1,  a  “quasi-first-order”  theory 
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Fig.  1  —  A  schematic  of  the  context  of  the  results  of  this  report 
in  the  development  of  underwater  acoustics 
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is  considered.  The  term  from  the  FW  transformation  that  is  known  to  be  responsible  for  uprange/ 
downrange  smearing  is  paired  with  the  basic  lowest  order  Hamiltonian.  The  result  is  a  hybrid  between 
first-  and  third-order  PE  theory,  but  the  two  extra  orders  in  the  FW  term  come  from  downrange 
derivatives,  which  are  in  this  context  excluded  from  the  power  counting.  Next,  in  Section  4.2,  the 
resultant  boundary  conditions  along  a  deterministic  interface  are  examined  in  detail.  In  order  to  interpret 
these  conditions,  it  is  necessary  to  consider  how  boundary  conditions  “migrate”  as  the  lead  order  in  the 
Hamiltonian  changes.  Indeed,  we  find  that  the  relationships  between  some  of  the  boundary  conditions  that 
appear  in  the  PE  and  their  corresponding  manifestations  in  the  full  wave  problem  are  often  the  most 
apparent  for  a  PE  of  specific  order,  and  that  the  connections  can  become  quite  obscure  at  other  orders. 
This  finding  will  play  a  pivotal  role  later  in  Section  6.1.  Section  4.3  includes  stochastic  effects,  most 
notably  the  classical  Lamb  shift  induced  by  a  sound  speed  jump.  This  time,  uprange/downrange  smearing 
cancels  out  in  the  stochastic  problem,  leaving  only  transverse  smearing  to  contribute  to  the  Lamb  shift 
associated  with  a  sound  speed  jump.  In  Section  4.4,  the  various  physical  phenomena  examined  in  this 
effort  are  placed  into  a  broader  context. 

Section  5.  The  density  jump  is  introduced  here.  In  Section  5.1,  the  basic  components  of  the  FW 
procedure  are  adapted  to  the  case  where  the  density  jumps,  and  then  Section  5.2  discusses  S -function 
bifurcation,  an  important  new  tool  needed  for  adapting  the  results  to  interfaces  where  the  density  jumps. 
In  Section  5.3,  the  interface  where  the  density  jumps  is  examined  for  the  first  time.  Once  again, 
deterministic  “quasi-first-order”  theory  is  used  to  examine  the  smearing  induced  by  the  new  FW  term. 
Then,  the  stochastic  problem  is  considered  and  the  classical  Lamb  shift  associated  with  a  density  and 
sound  speed  jump  is  obtained.  Transverse  smearing  induced  by  averaging  the  rough  interface  once  again 
dominates,  but  now  with  a  density  jump  present,  the  smearing  induced  by  the  FW  term  survives  the 
averaging  process.  The  examination  of  the  classical  Lamb  shift  concludes  with  a  discussion  of  its  possible 
relevance  to  underwater  acoustics. 

Sections  4  and  5  juxtapose  the  dominant  component  of  the  Lamb  shift,  transverse  smearing  in  the 
stochastic  problem,  with  tilt-induced  smearing  in  the  deterministic  problem  in  order  to  demonstrate  that 
the  PE  buffers  (i.e.,  smears  out)  singularities  in  the  deterministic  problem  in  a  way  that  closely  mimics  the 
buffering  imposed  by  the  stochastic  problem. 

Section  6.  With  the  conclusion  of  our  discussion  of  the  classical  Lamb  shift,  we  finally  have  developed 
the  formal  wherewithal  to  pursue  the  primary  goal  of  this  report:  the  construction  of  a  PE  suitable  for  a 
rough  interface  characterized  by  a  density  jump.  This  issue,  which  is  the  most  pressing  one  from  a 
practical  point  of  view,  is  addressed  in  Section  6.  In  Section  6.1,  a  new  effect  associated  with  a  density 
jump  is  identified,  named  (as  Bragg-scale  vorticity),  and  then  incorporated  into  the  PE  formalism.  Bragg- 
scale  vorticity  occurs  because  along  a  density  jump,  the  fluid  picks  up  an  oscillating  twist  at  the 
wavelength  scale.  Although  Bragg-scale  vorticity  enters  the  acoustic  problem  in  a  way  that  evokes  no 
direct  analogies  from  atomic  physics,  the  mathematical  formalism  built  up  to  this  point  imposes  a  specific 
procedure  for  incorporating  this  effect.  It  emerges  naturally  from  the  high-order  PE  at  an  interface,  where 
the  density  jumps.  A  full  second-order  PE  at  the  interface  is  derived  and  analyzed.  The  subtle  mechanics 
associated  with  the  formalism  is  discussed  as  well.  Once  again,  we  find  that  the  singularity  (in  this  case 
the  density  jump)  has  been  buffered.  This  type  of  buffering  is  more  subtle  than  the  ones  studied  in 
Sections  3.2,  4,  and  5.3,  and  these  previous  results  provide  a  conceptual  groundwork  for  recognizing  and 
understanding  this  phenomenon.  The  new  result  is  then  placed  into  the  context  of  current  PE  techniques 
in  Section  6.2. 

Section  7.  This  section  adapts  the  formal  approach  developed  for  the  acoustic  field  with  varying 
density  to  the  electromagnetic  (Section  7.1)  and  elastodynamic  (Section  7.2)  problems.  The  discussion 
sets  up  the  problem  by  deriving  the  state  space  equation  and  by  taking  a  brief  look  at  the  transformations 
connecting  the  familiar  full-wave  fields  with  the  associated  auxiliary  fields  that  are  propagated  by  the  PE. 
It  turns  out  that  jumps  in  electric  and  magnetic  permeability  (in  electromagnetic  theory)  and  in  the  second 
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Lame  parameter  (in  the  theory  of  elastic  waves)  involve  issues  that  are  similar  to  those  associated  with 
jumps  in  the  density  in  the  acoustic  problem  (i.e.,  the  issues  associated  with  Bragg-scale  vorticity). 

Section  8.  This  wraps  up  the  current  discussion.  The  results  are  summarized  in  Section  8.1,  and  finally 
future  developments  are  discussed  in  Section  8.2. 

2.  BACKGROUND  AND  MOTIVATION 

This  section  examines  the  context  of  this  work.  Section  2. 1  establishes  the  basic  PE  formalism  to  be 
used  in  this  report,  while  in  Section  2.2  examines  the  implications  of  an  interface  where  the  density 
jumps.  Section  2.3  reviews  previous  adaptations  of  the  PE  formalism  to  Bragg  scattering  from  a  rough 
interface  associated  with  a  density  jump,  and  then  identifies  the  shortcomings  that  have  limited  the  utility 
of  these  approaches. 

2.1  The  Parabolic  Equation  (PE) 

2.1.1  The  Significance  of  the  Parabolic  Equation 

The  salient  features  of  the  PE  are  that  it  inherently  selects  out  a  preferred  direction  designated  as  the 
range  and  then  further  stipulates  that  the  range  dependence  of  the  environmental  parameters  is  modest  and 
that  the  propagation  is  mostly  in  the  downrange  direction.  A  more  precise  definition  of  these  conditions 
will  emerge  as  we  examine  the  formalism  below.  For  the  moment,  note  that  this  scenario  is  most  typical 
of  ducted  propagation  -  in  other  words,  it  occurs  in  environments  where  the  propagation  is  by  and  large 
constrained  to  the  vicinity  of  the  downrange  direction  for  a  broad  range  of  initial  conditions.  Two 
examples  commonly  found  are  long-range  propagation  in  the  deep-water  sound  channel  and  shallow 
water  propagation  (for  current  purposes,  the  shallow  water  problem  is  characterized  by  depths  on  the 
order  of  30  to  100  m  and  ranges  in  the  1  to  40  km  range).  The  latter  is  currently  of  more  interest  from  a 
practical  point  of  view,  and  it  also  involves  the  more  challenging  issues  from  a  modeling  perspective.  In 
this  context  the  PE  is  typically  applied  to  the  50  to  5000  Hz  frequency  range,  but  the  results  developed  in 
the  discussion  that  follows  are  not  a  priori  restricted  to  this  range.  The  theoretical  work  developed  here 
therefore  aims  to  broaden  the  techniques  that  are  available  for  modeling  the  propagation  of  acoustic 
waves  in  a  ducted  underwater  environment. 

If  the  features  of  the  ducted  environment  are  all  significantly  larger  than  a  wavelength,  then  the 
geometrical  acoustics  limit  applies,  and  the  propagation  can  be  modeled  by  tracing  rays.  This  method  is 
attractive  because  it  is  fast.  This  advantage  is  particularly  significant  in  time-dependent  problems.  The 
time  imposes  an  extra  dimension,  and  one  must  either  discretize  the  time  domain  or  use  Fourier 
decomposition.  This  complicates  the  problem  and  slows  the  numerical  algorithms  used  to  model  the  field, 
but  ray  tracing  bypasses  this  need.  The  widespread  use  of  ray  tracing  is  further  guaranteed  by  the  fact  that 
it  is  a  very  effective  way  to  model  signal  spread  due  to  multipathing,  which  is  of  particular  interest  to 
those  that  evaluate  a  field’s  utility  as  a  signal  carrier. 

The  utility  of  ray  tracing  is,  however,  limited  by  several  factors.  In  complex  environments,  rays 
proliferate  dramatically,  and  the  bookkeeping  associated  with  this  technique  can  become  prohibitive.  In 
fact,  in  environments  with  significant  range  dependence,  a  ducted  environment  can  exhibit  chaos-like  ray 
proliferation,  and  the  breakdown  of  ray-tracing  method  occurs  on  a  fundamental  level  [9,10].  On  an 
equally  fundamental  level,  ray  tracing  cannot  model  full-wave  effects  associated  with  physical  acoustics. 
These  are  present  when  the  environmental  parameters  vary  on  the  wavelength-scale  (as  often  occurs  in 
acoustics).  For  these  reasons,  in  underwater  acoustics,  ray-tracing  will  often  have  to  be  abandoned  in 
favor  of  full-wave  techniques. 
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In  principle,  ducted  propagation  problems  can  be  solved  using  full-wave  techniques  such  as  numerical 
inversion  or  coupled  modes.  In  such  approaches,  all  points  are  causally  connected  in  the  sense  that  the 
field  at  any  given  point  is  influenced  by  what  happens  to  the  field  at  every  other  point.  In  one  manner  or 
another,  the  formalism  is  ultimately  forced  to  calculate  all  these  mutual  relationships.  In  many 
realistically  complex  environments,  these  techniques  become  so  numerically  intensive  that  they  often  fall 
beyond  the  practical  range  of  even  the  fastest  computers.  Furthermore,  brute  force  calculations  of  this 
type  are  so  divorced  from  physical  insight  that  their  utility  as  diagnostic  tools  is  limited,  and  it  is  often 
difficult  to  separate  numerical  artifact  from  bona  fide  physical  effect. 

The  PE  is  a  one-way  stepping  algorithm  based  on  the  physical  insight  that  in  ducted  propagation  the 
value  of  the  field  at  a  given  point  in  space  can  be  by  and  large  determined  without  knowledge  of  what 
subsequently  happens  to  the  field  downrange  from  that  pointb.  The  scaled-back  causal  structure 
significantly  streamlines  numerical  calculations  based  on  the  PE.  Furthermore,  in  range-dependent 
environments,  the  PE  correctly  models  mode  coupling,  and  strong  mode-coupling  is  precisely  the 
phenomenon  that  leads  to  the  breakdown  of  the  coupled-mode  and  ray-tracing  approaches0.  The  PE  thus 
expresses  the  wave  equation  in  a  representation  that  is  ideally  suited  for  the  computer  modeling  of 
classical  wave  propagation  in  a  ducted  environment*1. 

2.1.2  The  Underlying  Physical  Problem 

This  section  establishes  a  basic  notation  for  our  study  of  the  PE  and  then  uses  it  to  outline  the  physical 
problem  that  is  to  be  approximated  by  the  PE. 

2. 1.2.1  The  Basic  Geometry  of  the  Environment 

Assume  a  Cartesian  coordinate  system  with  the  positive  x-axis  pointing  in  the  downrange  direction  and 
z  the  depth  coordinate  (the  convention  in  this  report  has  the  z-axis  point  upward,  for  example,  away  from 
the  ocean  bottom  and  towards  the  air-sea  interface).  Denote  the  two-dimensional  vector  transverse  to  the 
range  by  RT  =  (y,  z)e.  The  geometry  of  the  ducted  environment  is  shown  in  Fig.  2. 


b  To  see  this,  note  that  Huygens’  principle  recovers  all  the  geometrical  and  physical  optics  (or  acoustics)  associated 
with  forward  propagation,  is  valid  in  the  physical  three-dimensional  space,  and  only  depends  on  the  same  one-way 
causality  inherent  in  the  PE. 

c  The  former  breaks  down  because  solutions  constantly  need  to  be  glued  together,  and  the  latter  because  of  ray 
proliferation. 

d  It  should  also  be  noted  that  the  PE  often  provides  more  physical  intuition  than  do  “brute-force”  simulations  based 
on  full-wave  theory.  That  is  ultimately  the  reason  why  the  Schrodinger  equation  is  still  widely  taught  and  used  to 
study  the  hydrogen  atom,  even  though  simulations  based  on  numerical  solutions  to  relativistic  quantum  mechanics 
are  in  principle  available. 

e  This  coordinate  system  is  preferable  for  the  formal  work  pursued  in  this  report,  although  cylindrical  coordinates  are 
more  appropriate  for  many  physical  applications.  The  result  in  cylindrical  coordinates  is  mathematically  identical  to 
that  obtained  when  the  Cartesian  coordinate  system  is  used,  provided  that  the  radial  coordinate  r  is  substituted  for  x, 
the  wave  function  is  rescaled  by  a  factor  1/Vr  ,  and  nonpropagating  terms  proportional  to  1  Jr  are  dropped. 
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Ducted  environment 

Fig.  2  —  The  geometry  of  a  typical  ducted  environment 


2.1.2.2  The  Related  Full-Wave  Equation 


The  acoustic  pressure  field  P(x,RT,t )  is  assumed  to  propagate  according  to  the  acoustic  wave 
equation  (see  Ref.  11) 


pV- 


f  \  -  ' 

-VP 

p  J 


-T|rf  =  0 


through  a  time-independent  environment.  The  three-dimensional  gradient  is  given  by  Vs(3/3x,Vr), 
where  VT  = (d/dy,d/dz)  is  the  two-dimensional  gradient  in  the  transverse  direction.  In  general,  the 
density  p  and  the  sound  speed  c  can  depend  on  the  coordinates  (x,RT) .  Since  the  problem  is  time- 
independent,  each  frequency  co  can  be  considered  separately,  and  so  the  pressure  is  given  by  the  integral 
over  co  of  Pw(x,RTj)  =  Re|^A(v,Pr J  ,  where  A  is  a  complex  field  obeying  the  equation 


pV 


^ 

— VA 
P 


co  .  . 

+  — A-0 . 

c 


(1) 


By  time  averaging  over  the  period  of  the  wave,  observable  physical  quantities  such  as  the  downrange 
energy  flux  and  the  scattering  cross-section  can  be  calculated  directly  from  the  complex  field  A. 

Note  that  c  and  p  depend  on  the  two-dimensional  transverse  coordinate  Rr  .  In  many  instances  below, 
we  consider  the  two-dimensional  problem  where  the  v-coordinatc  falls  out  and  the  depth  z  is  the  only 
transverse  coordinate.  To  be  specific,  general  formal  development  will  apply  both  to  the  full  three- 
dimensional  x-  y  -  z  -space  and  to  two-dimensional  x  —  z  space.  However,  in  the  current  report,  once 
interfaces  are  explicitly  introduced  into  the  formalism,  only  one-dimensional  interfaces  z  =  f(x ) 
embedded  in  two-dimensional  x  —  z  space  are  considered.  The  generalization  to  the  full  three- 
dimensional  space  is  very  briefly  examined  at  the  end  of  Section  3.3.1  and  again  in  Section  3.3.3. 
Although  this  treatment  is  supplemented  by  Appendix  G,  the  full  development  of  a  formalism  good  for 
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two-dimensional  surfaces  z  =  f(x,y )  embedded  in  three-dimensional  x—y  —  z  space  is  deferred  to 
future  work.  Currently  available  applications  of  the  PE  formalism  also  typically  apply  to  the  two- 
dimensional  problem  (x  —  z  -space). 

2.1.3  A  First  Look  at  the  PE 


Let  us  now  introduce  the  PE  with  an  intuitive  and  frequently  used  derivation  of  its  most  basic  form  and 
then  briefly  examine  how  the  result  is  used  by  modelers.  The  effects  of  range-dependence  are  examined 
next,  and  this  is  followed  by  a  discussion  of  the  fundamental  limits  inherent  to  the  kinds  of  PEs  developed 
by  the  approaches  pursued  in  this  report. 

2.1.3.1  A  “Quick  And  Dirty”  Derivation 


For  the  moment  assuming  that  the  density  is  constant.  A  obeys  the  Helmholtz  equation: 

d2A  i  , 

—  +  V2TA  +  k2(x,RT)A  =  0. 
dx 


(2) 


The  wave  number  k  is  the  frequency  CO  divided  by  the  sound  speed  c.  In  the  traditional  derivation  of  the 
PE.  the  range  dependence  of  the  sound  speed  profile  is  assumed  to  be  sufficiently  modest  that  it  can  be 
ignored  (i.e.,  temporarily  assume  that  k  =  co/c  (R,  )  =  k(  Rr ) ).  It  is  then  possible  to  unambiguously  factor 
the  Helmholtz  equation: 


f  ■ d 

~l - h  y 

JVl  +  k2 

[  i—  +  M  +  k2 

^  dx 

V  T 

y 

[dx  V  r  ) 

(3) 


Since  k  is  range-independent,  id/dx  and  +k2  commute*  and  the  cross-terms  cancel  so  that  the 

product  above  indeed  reproduces  the  Helmholtz  operator.  Furthermore,  the  order  of  the  factors  in  the 
product  does  not  matter,  and  thus  A  is  a  solution  to  the  Helmholtz  equation  whenever  either  factor 
operating  on  A  is  zero.  Typically,  the  branch  that  corresponds  to  downrange  propagation  is  chosen,  and 
the  terms  are  reorganized  to  create  an  operator  “  2/1/ k0  ”  that  is  small  in  some  sense8: 


where 


-*'f^-  =  VVr  +kl A  =  sjy2r-{l-n2)k;i  +k;A  =  k  fl  +  ^-A, 
ox  \j  ka 


k  =  nk„  for  reference  k 


(4) 


X 


(5) 


and 


1  —  n2 
2 


Complying  with  usage  standard  in  Quantum  Mechanics,  differential  operators  (such  as  X)  embedded 
inside  functions  are  understood  to  be  the  Taylor  series  expansions  of  the  function  with  the  operator  acting 


*  Two  operators  a  and  (3  commute  if  [a.  f3\  =  aj3  -  j3a  =  0  .  [a,  fl\  is  called  the  commutator  of  a  and  (3.  We 
will  also  make  frequency  use  of  the  anti-commutator  {a.  /3}  =  af3  +  f3a .  If  [a.  f3]  =  0  ,  then  the  two  operators  are 
said  to  anti-commute. 

g  Typically  small  means  “corresponds  to  small  (dimensionless)  eigenvalues  in  the  range  independent  case,”  but  this 
has  to  be  generalized  when  the  density  varies. 
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as  the  expansion  parameter11.  Specifically,  the  operator  *Jl  +  22/k0  will  always  be  understood  to  be 
synonymous  with  the  Taylor  series  expansion  obtained  by  treating  2A/k0  as  an  expansion  parameter.  The 
finite  expansion  of  k0^Jl  +  2A/k0  is  known  as  the  Hamiltonian  H.  For  example,  in  Section  6.2.2,  it  will  be 
convenient  and  illuminating  to  use  the  shorthand  whereby  functions  of  the  operator  k0  J 1  +  2 A/k0  are 
written  as  functions  of  —id/ dx  (following  Eq.  (4)). 

The  PE  derived  from  the  Helmholtz  equation  closely  parallels  the  Schrodinger  equation  derived  from 
the  Klein-Gordon  equation1.  The  first-order  version  of  the  PE  was  applied  to  electromagnetic  fields  by 
Leontovitch  and  Fock  [12],  and  it  was  later  applied  to  problems  in  geophysics  by  Claerbout  [13]. 
Claerbout  introduced  the  full  square  root  operator  to  the  classical  problem  in  1971  [14].  Tappert  brought 
the  PE  formalism  to  underwater  acoustics  in  1977  [15]. 

2. 1.3.2  The  Numerical  Implementation 

When  the  PE  is  used  in  numerical  calculations,  it  is  discretized  and  transformed  into  a  stepping 
algorithm  as  follows.  Consider  some  field  %  that  obeys  a  generic  PE  —i  d%/dx  =  H /  .  The  Hamiltonian 
H  is  a  function  of  the  transverse  derivative  Vr .  The  leading  order  term  is  proportional  to  .  The 
equation  is  discretized  to  first-order  in  the  step  size  Ax  using  the  Mean  Value  Theorem.  This  constitutes 
the  basic  stepping  algorithm.  In  a  practical  application,  a  pair  of  additional  manipulations  would  typically 
be  superimposed  on  top  of  this  basic  algorithm.  The  Crank-Nicholson  procedure  is  usually  used  to  make 
the  algorithm  manifestly  unitary  (even  for  finite  step  size)  [16],  and  often  (for  a  variety  of  technical 
reasons,  including  most  notably  to  speed  convergence),  H  would  also  be  broken  into  fractions  using  a 
Pade  approximation  (pioneered  by  M.  Collins  -  see,  for  example,  Refs.  17  or  18;  more  precise 
terminology  would  be  to  call  this  a  rational  function  approximation^.  Since  these  operations  are  imposed 
in  separate  formal  steps  after  the  basic  stepping  algorithm  has  been  fully  specified,  it  is  not  appropriate  to 
consider  them  in  the  formal  development  being  pursued  in  this  report.  Only  the  basic  stepping  algorithm 
is  therefore  discussed  here.  Subsequent  imposition  of  the  Crank-Nicholson  algorithm  should  be  fairly 
straightforward  since  it  involves  the  same  Hamiltonian,  but  adaptation  of  the  Pade  approximation  to  the 
new  basic  formalism  will  be  a  nontrivial  topic  for  future  research. 


h  Differential  operators  embedded  inside  functions  are  more  generally  called  pseudo-differential  operators,  but  this 
terminology  also  incorporates  circumstances  where  the  Taylor  series  expansion  does  not  converge. 

1  In  this  case,  (using  units  where  the  Planck's  constant  fi  and  the  speed  of  light  are  set  equal  to  one)  the  reference 
wave  number  k0  becomes  the  rest  mass,  time  takes  the  place  of  the  range  X ,  the  transverse  space  is  physical 
x  —  y  —  z  -space,  and  the  Minkowski  metric  of  four-dimensional  space  time  imposes  an  extra  minus  sign  in  front  of 
the  transverse  gradient:  V2  =  Vr  •  Vr  => -V  •  V  = -V2 .  Second  and  higher  orders  in  2A/ka  provide  relativistic 
corrections.  Usually,  /J  (now  some  dimensionless  measure  of  a  scalar  potential)  is  set  to  0  in  this  case.  Scalar 
potentials  affecting  bosons  are  not  typically  introduced  directly  into  the  scalar  Klein-Gordon  equation.  In  fact,  the 
physically  realistic  version  of  that  problem  would  involve  subtleties  not  relevant  to  the  discussion  here. 

JThe  Crank-Nicholson  procedure  is  necessary  because  we  have  discretized  with  finite  steps: 
—iAA  =(UA)Ax  +  O | f Ay ) 2 j .  At  finite  order  in  Ax,  this  operation  is  unitary  only  in  the  limit  as  Ay— >0.  The 
Crank-Nicholson  procedure  replaces  this  with  a  stepping  prescription  that  is  the  same  to  o|(Ay)~  j ,  but  preserves 
unitarity  exactly  (even  for  finite  Ay).  For  example:  Anew  =  (] +i(Ax)H)Aahl  — >  Anew  =  [(\+i(Ax)H /2]/ 
(1  —i(Ax)H  /  2)]Aold  or  (l  — i  (Ax)H /2)  A^  =  (l  +  z  (Ay)  ///2)  Aold .  When  the  problem  is  formulated  in  this  form,  it  is 
possible  to  solve  for  Anew  using  standard  techniques  in  numerical  analysis.  The  Pade  approximation  represents  yet 
another  step  beyond  this.  Now  H  =  p/q  and  we  have  (q  —  i (Ax')  p/2)  Anew  =  (q  +  i  (Ay)  p/2)  Aold .  This  is  a 
convenient  way  to  obtain  H  to  high  orders  in  22/ k0  . 
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2.1.3.3  Range  Dependence 

The  formalism  related  to  the  PE  can  be  upgraded  to  incorporate  the  effects  of  range  dependencek  via  a 
two-step  process.  The  first  step  is  intuitively  straightforward,  while  the  second  step  is  subtler1. 

The  procedure  begins  with  the  observation  that  in  a  range-dependent  environment,  a  factorization  of 
the  type  given  in  Eq.  (3)  generates  an  unwanted  commutator  term  in  addition  to  the  terms  in  the 
Helmholtz  Eq.  (2).  The  first  step  for  incorporating  this  artifact  of  range  dependence  into  the  PE  formalism 
occurs  when  we  introduce  order-by-order  correction  terms  into  the  Hamiltonian  to  cancel  the  unwanted 
commutator  terms  (maintaining  the  symmetry  between  the  uprange  and  downrange  classes  of  solutions). 
Note  that  every  time  we  add  a  new  term  to  the  Hamiltonian,  we  end  up  generating  a  new  unwanted 
commutator  term  of  higher  order,  so  the  procedure  is  iterative.  A  term  proportional  to  ifl 

( =  idjLl/dx  =  [/%,.//]  °=  \i%x,kQ  +  Z] ;  in  this  report  the  dot  above  a  variable  always  denotes  the  downrange 
derivative  d/dx)  is  added  to  perform  the  first  iteration  on  the  0(/l)  Hamiltonian.  To  O (/LA)  this  term 
indeed  cancels  the  commutator  term  produced  by  a  factorization  of  the  type  given  in  Eq.  (3)  above.  A 
lucid  and  well-developed  examination  of  the  effects  of  such  a  term  is  given  in  Schurman  et  al.  [19].  Some 
interesting  results  are  obtained,  for  example,  for  the  case  of  fronts,  but  there  is  a  potential  problem  with 
this  formulation  that  can  cause  trouble  if  this  procedure  is  applied  to  the  general  range -dependent 
problem.  The  additional  term  induced  by  the  range  dependence  is  non-Hermitian,  so  the  integral  over 

transverse  space  of  the  magnitude  of  the  field  | dRr  |A|"  is  no  longer  conserved.  Lacking  a  conservation 
law,  there  is  nothing  to  enforce  stability  in  numerical  calculations. 

The  second  step  of  the  procedure  for  incorporating  the  effects  of  range  dependence  sidesteps  this 
problem  by  noting  that  the  non-Hermitian  terms  in  the  Hamiltonian  do  not  accumulate  with  downrange 
propagation,  but  only  operate  at  the  endpoints.  Taking  a  cue  from  earlier  work  by  Bremmer  [20],  Tappert 
had  already  by  1977  incorporated  this  insight  to  form  an  optimal  range -dependent  formalism  for  the  PE 
[15,  21-22]  (in  Ref.  15,  see  especially  pp.  278-279,  and  in  Ref.  21,  note  the  discussion  following  Eq. 
(11)).  This  formalism  can  be  derived  from  the  first  step  described  above  using  the  following  logic.  Since 
ifl  is  a  perfect  range -derivative  in  the  sense  that 

..  .  d/f  .  ;  r  «-|  .  d  A* 

ifl  =  1  —  =  i A  +  [fz  ,/t]  =  / - , 

dx  dx 


(where  recall  that  H  is  the  expansion  in  2T//c0  of  +  we  can  immediately  recognize  that  this 

term  only  operates  at  the  endpoints,  and  so  it  is  not  an  inherent  part  of  the  propagation  itself.  It  can 
therefore  be  removed  from  the  propagation  equation,  and  applied  as  a  transformation  at  the  endpoints. 
Generalizing  to  include  higher  orders,  this  class  of  effects  can  be  incorporated  into  the  theory  using  the 
following  prescription:  take  out  the  WKB  amplitude  at  the  initial  range  to  form  the  auxiliary  field 

^  =  (l  +  [2/l/k0])  4  A,  propagate  the  field  % ,  and  put  the  WKB  amplitude  back  in  after  propagation  is 

completed  A  =  (1  + [2A/k0])  ^  X  ■  (A  nice  derivation  of  this  result  for  the  one -dimensional  string 

displacement  problem  can  be  found  in  Ref.  23.)  The  Hamiltonian  used  to  propagate  this  auxiliary  field  is 
Hermitian,  which  implies  that  the  integral  across  transverse  space  of  the  magnitude  squared  of  the  field 


k  This  means  that  the  sound  speed  now  depends  on  the  range:  c  =  c(x,  Rr ) ,  and  so  (of c  =  k  =  k( x,  RT ) ,  but  for  the 
moment  the  density  p  still  remains  everywhere  the  same. 

1  The  two-step  argument  outlined  here  is  provided  primarily  for  historical  context.  While  the  basic  results  that 
emerge  from  this  process  will  in  one  way  or  another  reemerge  from  all  the  techniques  we  will  subsequently  be 
examining,  they  will  be  the  product  of  very  different  lines  of  reasoning. 
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X  is  conserved.  Since  is  to  within  an  integration  by  parts  the  downrange  flux  (see  Appendix  A), 

this  conservation  rule  also  guarantees  energy  conservation.  Note  that  Hermiticity  of  the  Hamiltonian  also 
produces  a  propagation  equation  that  is  numerically  stable.  Since  energy  conservation  is  so  intimately 
connected  to  numerical  stability,  it  is  a  very  desirable  attribute  to  find  in  a  PE. 

2. 1.3.4  Phenomena  Not  Addressed  by  this  Formalism 

It  is  important  to  bear  in  mind  that  this  formalism  is  designed  to  model  downrange  propagation.  It 
ignores  coupling  between  downrange  and  uprange  propagation  (backscatter).  Furthermore,  by  considering 
2/1/ kQ  as  a  small  parameter,  we  exclude  an  entire  class  of  solutions  to  the  Helmholtz  equation  associated 
with  purely  transverse  energy  flow  (evanescent  solutions;  for  more  on  this  phenomenon,  see  the  recent 
work  by  Fishman  and  collaborators  [24,  25]  and  a  brief  discussion  in  the  next  to  last  paragraph  of 
Appendix  A).  Both  backscatter  and  evanescent  modes  are  coupled  to  the  downrange  propagating 
solutions  by  terms  that  go  to  zero  faster  than  any  finite  order  of  the  PE  expansion  in  the  limit  as  the  range 
dependence  goes  to  zero.  Specifically,  this  type  of  coupling  comes  from  a  physical  process  known  as 
“pair-production”  and  the  probability  of  pair-production  occurring  in  a  given  unit  volume  goes  to  zero  as 
exp  constant/ (see  Appendix  B  for  more  on  this  topic).  The  coupling  is  quite  weak  in  many 
typical  underwater  propagation  problems,  and  in  such  cases  it  can  safely  be  ignored. 

2.2  The  Ocean  Bottom:  Introducing  a  Density  Jump 

2.2.1  The  Change  of  Variable  Technique 

A  premier  example  of  an  effort  based  on  the  PE  to  model  shallow  water  ducted  propagation  is  given  in 
a  recent  article  by  Rouseff  and  Ewart  [26].  A  series  of  simulations  are  conducted.  At  first,  the  bottom,  air- 
sea  interface  and  depth-dependent  sound  speed  profile  are  all  smooth  —  in  other  words,  they  are  not 
functions  of  the  range.  Then  random  roughness  is  added  to  the  air-sea  interface  and  bottom  (e.g.,  see  Figs. 
1  and  3  of  the  reference).  There  is  also  a  gentle  slope  to  the  bottom  (a  slope  of  roughly  4  m/km  or  an 
angle  of  about  0.2°),  but  this  range  dependence  is  very  small  compared  with  stochastic  roughness  and  it  is 
neglected  for  the  purposes  at  hand  (both  in  Ref.  26  and  below  in  this  report).  The  simulation  by  Rouseff 
and  Ewart  confirms  the  intuitive  result  that  in  the  absence  of  significant  range  dependence,  mode 
stripping  eliminates  bottom-penetrating  modes,  and  the  field  is  confined  to  the  water  column.  The 
numerical  simulation  also  plausibly  predicts  that  with  the  introduction  of  multiscale  range  dependence, 
energy  is  pumped  back  into  bottom-penetrating  modes,  and  there  is  enhanced  bottom  penetration.  This 
kind  of  bottom  penetration  is  currently  the  focus  of  active  research  [27-30]  and  it  has  potential  relevance 
to  practical  applications.  For  example,  an  acoustic  field  that  penetrates  the  bottom  can  scatter  from  buried 
objects,  and  so  it  can  in  principle  be  exploited  to  remotely  image  these  objects.  However,  in  order  to 
extract  such  information  from  the  field,  it  is  necessary  to  accurately  model  its  behavior  as  it  traverses  the 
range-dependent  environment  on  the  way  to  and  from  the  object  being  imaged.  This  accuracy  must  not 
only  be  qualitatively  correct,  but  also  quantitatively  accurate. 

As  the  field  propagates  downrange,  three  multiscale  stochastic  processes  continuously  pump  energy 
back  into  the  bottom-penetrating  modes  (i.e.,  modes  such  that  for  one  reason  or  another  the  grazing  angle 
is  effectively  greater  than  the  critical  angle): 

•  Roughness  of  the  air-sea  interface. 

•  Advection  of  the  water  column  by  internal  waves  and  turbulence  (Rousseff  and  Ewart  do  not 
pursue  this  in  the  referenced  paper,  but  the  phenomenon  exists  in  the  real  ocean,  and  could  be 
added  to  such  a  simulation  as  well.) 

•  Roughness  of  the  ocean  bottom. 
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The  interaction  of  the  field  with  the  impenetrable  rough  air-sea  interface  is  well  described  using  a 
theory  by  Tappert  and  Nghiem-Phu  [31],  and  advection  of  the  layers  in  the  water  column  by  internal 
waves  and  turbulence  can  be  adequately  modeled  using  the  basic  PE  described  above. 

The  treatment  of  the  rough  bottom,  however,  requires  further  scrutiny.  This  problem  is  challenging 
because  the  density  jumps  at  the  ocean  bottom.  The  classic  PE  technique  outlined  in  Section  2.1  assumes 
that  the  density  is  everywhere  uniform,  but  at  the  ocean  bottom  the  density  jump  is  significant.  The 
density  may  jump  by  as  much  as  a  factor  of  2,  while  the  sound  speed  jump  is  typically  much  more  modest 
(often  just  a  few  percent).  Furthermore,  we  know  from  rough  surface  scattering  theory  that  at  a  two-fluid 
interface,  the  density  p  formally  plays  just  as  prominent  a  role  in  the  formulas  for  the  reflected  and 
transmitted  scattering  amplitudes  as  does  the  sound  speed  (e.g.,  see  Appendix  C  of  Ref.  32).  Therefore,  to 
properly  model  scattering  from  the  ocean  bottom,  the  PE  technique  must  be  modified  to  allow  the  density 
to  change. 

Reference  26  employs  a  technique  by  Tappert  (see  pp.  262  through  264  of  Ref.  15)  to  incorporate 
density  variation  into  the  PE  formalism.  The  auxiliary  field  defined  by  u  =  A / -Jp  obeys  a  Helmholtz 
equation  with  an  effective  index  of  refraction,  and  this  Helmholtz  equation  can  be  converted  to  a  PE  (p 
is  the  density  as  a  function  of  the  coordinates).  However,  in  the  limit  as  the  density  dependence  acquires  a 
step,  the  effective  index  of  refraction  acquires  S  -functions,  and  the  following  ad  hoc  fix  must  be 
imposed:  the  step  is  replaced  with  a  gradual  transition.  The  PE  that  is  produced  by  this  technique  will 
henceforth  be  referred  to  as  the  Change  of  Variable  (COV)  PE. 

2.2.2  The  “Stair  Step”  PE 

As  attested  to  by  Ref.  26,  the  COV  PE  is  still  used  for  specialized  problems  such  as  scattering  from 
rough  surfaces,  but  for  most  standard  problems  in  underwater  acoustics,  it  has  been  supplemented  by  an 
alternate  technique  by  Collins  and  Westwood  [18].  In  this  approach,  the  interface  is  approximated  by  stair 
steps  in  order  to  the  exploit  the  fact  that  it  is  possible  to  directly  take  the  square  root  of  the  range- 
independent  problem  (even  when  the  density  varies  as  a  function  of  a  transverse  coordinate  such  as  the 
depth).  When  the  interface  steps  vertically,  energy  conservation  is  explicitly  forced  (by  demanding 
continuity  of  the  downrange  energy  flux).  This  much  of  the  procedure  was  concurrently  proposed  by 
Porter  et  al.  [33],  but  Ref.  18  perfects  the  technique  with  an  additional  ingredient.  Forcing  energy 
conservation  at  a  vertical  interface  leads  to  a  discontinuity  of  the  pressure  field,  while  the  boundary 
conditions  along  the  horizontal  interface  demand  pressure  continuity.  This  leads  to  an  unphysical 
discontinuity  in  the  pressure  at  the  corners  of  the  stair  step,  which  in  turn  spawns  Gibbs  oscillations. 
These  are  eliminated  using  complex  Pade  coefficients™.  The  imaginary  parts  of  the  Pade  coefficients 
effectively  introduce  (in  this  case  unphysical)  evanescent  modes  that  restore  field  continuity  without 
introducing  a  superfluous  downrange  flux.  Below  in  this  report,  the  approach  by  Collins  and  Westwood  is 
referred  to  as  the  Stair  Step  PE. 


The  current  prominence  of  the  Stair  Step  PE  dates  back  to  its  record  of  accuracy  established  during  the 
late  stages  of  a  program  that  during  the  late  1980’s  and  early  1990’s  systematically  benchmarked  various 
PE  methods  [34-36].  The  Stair  Step  PE  generally  produced  the  most  accurate  results  [36],  and  it  has  been 
dominant  ever  since. 


m  Strictly  speaking,  these  coefficients  actually  appear  in  rational  function  approximations  to  the  Hamiltonian 
generated  by  the  PE.  These  rational  functions  are  not  designed  to  formally  match  the  Taylor  series  expansion  order 
by  order  (a  type  of  least  squares  fit  is  generally  used  to  generate  the  optimal  coefficients  in  the  rational  functions). 
As  Frank  Henyey  has  pointed  out  that  (personal  communication),  these  rational  functions  are  thus  not  strictly 
speaking  true  Pade  approximations,  but  the  terminology  has  become  so  ubiquitous  in  this  context  that  we  will  use  it 
here  as  well. 
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2.3  Adapting  the  PE  to  the  Rough  Surface  Scattering  Problem 

2.3.1  Previous  Efforts  to  Design  a  PE  Optimized  for  Rough  Surface  Scattering 

2.3.1.1  Bragg  Scattering  Taxes  the  Stair  Step  PE 

The  Stair  Step  PE  can  become  somewhat  unwieldy  if  the  imposition  of  the  energy  conservation 
condition  and  the  accompanying  introduction  of  evanescent  waves  have  to  be  done  very  often.  As  a 
consequence,  although  the  state-of-the-art  PE  (Stair  Step  PE)  has  been  used  to  model  a  wide  range  of 
problems,  the  issue  of  Bragg  scattering  from  penetrable  interfaces  has  typically  been  explored  using 
alternate  techniques. 

In  fact,  while  PE  techniques  that  apply  to  rough  surface  scattering  from  impenetrable  boundaries  have 
been  adequately  benchmarked  [37,  38],  the  first  effort  to  benchmark  the  Stair  Step  PE  applied  to  rough 
penetrable  boundaries  is  only  now  being  pursued  by  Thorsos  (APL-UW).  This  is  significant  for  several 
reasons: 

1.  Bragg  scattering  is  phase  sensitive  in  a  way  that  typical  benchmark  problems  are  not.  The 
typical  benchmark  problem  involves  a  wedge  with  a  tilted  interface  separating  two  media  that 
have  different  sound  speed  and  density.  Since  the  slope  and  curvature  are  the  same  along  the 
entire  interface,  the  effects  of  introducing  an  overall  phase  shift  would  be  invisible.  On  the 
other  hand,  the  Bragg  scattering  that  is  so  characteristic  of  scattering  from  multiscale  rough 
surfaces  depends  on  a  delicate  interplay  of  the  relative  phases  of  the  field  scattered  from  nearby 
points  on  the  interface.  Without  accurate  information  about  the  relative  phase  of  the  different 
components  of  the  field,  it  is  impossible  to  model  Bragg  scatter.  Furthermore,  in  ducted 
propagation,  errors  accumulate  with  the  range.  For  this  reason,  we  need  a  rigorous  theory  that 
unambiguously  imposes  a  unique  phase. 

2.  The  Stair  Step  PE  implicitly  makes  a  number  of  ad  hoc  choices,  and  so  one  cannot  a  priori 
assume  that  it  will  produce  the  one  uniquely  correct  value  of  the  phase  for  the  field  generated  at 
a  given  point  on  an  interface.  Let  us  consider  several  arbitrary  choices  made  by  the  Stair  Step 
PE.  The  energy  conservation  condition  coupled  with  behavior  of  the  field  along  horizontal 
interfaces  forces  the  use  of  evanescent  waves  to  eliminate  Gibbs  oscillations.  There  is  some 
arbitrariness  in  how  the  Gibbs  oscillations  are  removed,  and  the  actual  choice  constitutes  one  ad 
hoc  imposition  that  can  only  be  validated  by  benchmarking.  Similarily,  the  stair  steps  are 
artificial,  and  energy  need  not  be  independently  conserved  on  the  treads  and  risers.  The 
assumption  that  it  is  constitutes  another  ad  hoc  imposition. 

3.  The  Stair  Step  PE  leaves  out  parts  of  both  the  physics  and  mathematical  requirements  related  to 
the  PE  problem.  For  example,  as  discussed  in  Section  3.3.3,  the  full  PE  has  higher-order 
derivatives  right  up  to  the  interface,  and  these  carry  extra  boundary  conditions  that  are  not 
specified  by  the  Stair  Step  PE  in  its  present  form.  Furthermore,  although  the  Stair  Step  PE,  like 
all  PE  methods,  is  sensitive  to  physical  acoustics  effects  like  diffraction,  which  in  turn  depend 
on  the  curvature  of  the  surface  over  an  extended  distance,  it  does  not  allow  for  effects  that  are 
sensitive  to  the  intrinsic  local  curvature  (cf.  polaritons  in  electromagnetic  theory). 

All  these  factors  can  play  an  important  role  in  Bragg  scattering,  particularly  within  a  ducted 
environment,  where  errors  accumulate,  even  though  the  same  factors  are  insignificant  in  the  benchmark 
problems  that  have  given  the  Stair  Step  PE  its  current  dominance. 

Thus,  it  is  premature  to  assume  that  the  success  of  the  Stair  Step  PE  in  less  challenging  problems  is  a 
reliable  indication  of  the  approach’s  capabilities  regarding  the  Bragg  scattering  problem.  The  Stair  Step 
PE  cannot  be  used  with  confidence  to  model  Bragg  scattering  until  Eric  Thorsos’  benchmarking  is 
complete,  and  its  validity  is  fully  established  in  this  context. 
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2.3.1.2  Existing  Alternatives  More  Appropriate  to  Rough  Surfaces  than  the  Stair  Step  PE 

Ideally,  an  alternate  formulation  that  does  not  rely  on  ad  hoc  arguments  in  its  development  would 
improve  on  the  Stair  Step  PE  in  two  ways: 

1 .  Being  a  systematic  theory,  it  would  be  improvable  by  adding  higher  orders  until  its  predictions 
benchmark  to  the  required  tolerance. 

2.  It  is  also  reasonable  to  hope  that  a  theory  constructed  from  fundamental  principles  will  pose  the 
problem  in  its  irreducible  form,  and  consequently  be  easier  to  implement. 

For  the  purpose  of  examining  scattering  from  penetrable  rough  surfaces,  Collins  (with  collaborators) 
has  developed  two  approaches  that  are  alternatives  to  the  Stair  Step  PE.  Both  of  these  represent  progress 
toward  the  goals  listed  above.  The  first  is  designed  so  that  its  predictions  can  be  made  to  agree  with  a 
given  result  to  a  required  tolerance,  and  is  fairly  easy  to  implement.  The  second  approach  returns  to  the 
fundamental  assumptions  behind  the  Stair  Step  PE,  and  recasts  the  problem  so  that  ad  hoc  mathematical 
patches  are  no  longer  necessary. 

The  first  approach,  developed  by  Collins  and  Chin-Bing,  actually  predates  the  Stair  Step  PE  [39].  It 
consists  of  a  phenomenological  theory  applicable  to  the  stochastic  problem.  Along  the  flat  average 
surface,  roughness-induced  changes  in  the  phase  and  amplitude  of  the  field  are  modeled  by  effective 
boundary  conditions  that  correspond  to  angle -dependent  complex  transmission  and  reflection  coefficients. 
As  with  all  phenomenological  methods,  previously  obtained  results  are  reliably  reproduced,  but  the 
technique  is  not  designed  to  accurately  predict  the  general  problem. 

In  the  second  alternative  to  Stair  Step  PE,  Collins  and  Evans  replace  the  rough  surface  with  a 
crenellated  surface  (resembling  battlements  on  a  medieval  castle)  [40].  These  features  are  a  little  bigger 
than  a  wavelength.  A  PE  is  used  to  step  whenever  the  interface  is  locally  flat.  At  vertical  interfaces,  the 
full  Helmholtz  boundary  conditions  including  backscatter  are  imposed.  (These  boundary  conditions  are 
called  the  “single-scatter”  boundary  conditions.)  Although  the  problem  chosen  is  modeled  accurately,  this 
scenario  is  quite  different  from  the  rough  surface  scattering  scenario.  The  most  important  difference  is 
that  a  rough  surface  typically  involves  tilted  interfaces  (with  slope  <  45°)  that  produce  only  subtle 
curvature-induced  backscatter,  while  the  vertical  interfaces  found  on  a  crenellated  surface  naturally  tend 
to  backscatter  strongly. 

This  distinction  is  so  fundamental  that  it  continues  to  cause  trouble  even  if  more  sophisticated 
implementations  of  the  single-scatter  technique  are  constructed.  In  principle,  the  basic  approach  could  be 
applied  to  interfaces  that  are  continuous  functions  of  the  range.  The  surface  would  then  be  broken  into 
stair  steps,  and  the  PE  would  be  used  to  step  downrange  when  the  surface  is  horizontal,  while  the  single¬ 
scatter  boundary  conditions  would  be  applied  at  the  vertical  interfaces.  The  problem  begins  to  appear 
when  we  note  that  it  is  possible  to  design  scenarios  where  even  a  correct  solution  to  the  full  Helmholtz 
equation  does  not  backscatter.  For  example,  consider  a  field  incident  on  a  surface  with  tilt  but  no 
curvature,  and  assume  that  the  grazing  angles  of  the  incoming  spectral  components  of  the  field  and  the 
surface  tilt  are  all  sufficiently  small  that  the  reflected  and  transmitted  wavevectors  all  point  downrange. 
There  is  obviously  no  backscattering  in  this  case.  On  the  other  hand,  a  calculation  that  discretizes  a 
surface  to  form  stair  steps  and  additionally  imposes  the  full  Helmholtz-equation  boundary  conditions  on 
the  vertical  interfaces,  will  continue  to  predict  backscattering  from  these  vertical  interfaces  deep  into  the 
physical  acoustics  limit.  It  would  be  necessary  to  discretize  the  interface  to  a  very  fine  (much  smaller  than 
one  wavelength)  resolution  before  backscatter  would  finally  fade  away.  A  numerical  solution  based  on 
this  formalism  would  be  much  more  resource-intensive  than  necessary,  and  it  would  involve  one-way 
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stepping  without  a  built-in  conservation  law  and  so  it  could  be  numerically  unstable  (recalling  the 
relationship  between  energy  conservation  and  stability  discussed  in  Section  2.1)n. 

2.3.2  Towards  A  New  Approach 

2.3.2.1  The  Physical  Significance  of  Conserving  Energy  in  Downrange  Propagation 

Scattering  from  penetrable  quasi-planar  rough  interfaces  typically  involves  tilts,  grazing  angles,  and 
environmental  parameters  that  are  compatible  with  the  scenario  examined  in  the  previous  paragraph 
(aside  from  surface  curvature,  which  induces  a  relatively  weak  type  of  backscatter).  Thus,  for  the  purpose 
of  calculating  the  scattering  from  a  typical  rough  surface,  an  energy-conserving  PE  would  have  an 
important  advantage  over  the  PE  based  on  the  single  scattering  boundary  conditions.  Since  the  energy- 
conserving  PE  a  priori  assumes  small  departures  from  range  independence,  it  correctly  anticipates  the  no- 
backscatter  result,  and  consequently  closely  reflects  the  physics  of  the  problem.  As  a  result,  its  numerical 
implementation  is  more  efficient  and  robust. 

The  thought  experiment  above  also  assures  us  that  a  better  energy-conserving  PE  must  exist.  Once 
again,  consider  the  tilted  interface  without  curvature.  Since  there  is  no  backscatter,  it  should  be  possible  to 
construct  a  downrange  stepping  algorithm  that  generates  a  field  that  can  be  made  arbitrarily  close  to  the 
full-wave  solution.  In  other  words,  there  is  nothing  inherently  special  about  a  variable  density  that  would 
preclude  the  existence  of  a  systematic  expansion  of  the  field  equation.  An  ad  hoc  procedure  that  reaches  a 
limit  beyond  which  improvement  is  impossible  cannot  be  the  final  word. 

Now  that  we  have  a  better  idea  of  what  we  are  looking  for,  we  can  proceed  to  find  it.  Collins’  and 
Evans’  decision  (in  Ref.  40)  to  construct  a  new  formalism  directly  from  the  underlying  field  equation  was 
a  sound  one.  In  fact,  we  have  just  seen  that  the  basic  approach  they  specifically  used  to  describe  a 
crenellated  interface  could  in  principle  be  applied  to  any  interface  to  obtain  an  answer  that  is  accurate  to 
any  given  precision.  However,  such  an  approach  turns  out  to  be  poorly  suited  for  the  broad  class  of 
scattering  problems  where  the  incident  grazing  angle  and  interface  slope  are  modest.  On  the  other  hand, 
the  approach  developed  in  this  report  begins  with  the  same  stalling  point,  but  then  it  employs  a  strategy 
more  closely  attuned  to  the  physics  and  the  modeling  requirements  of  the  problem:  begin  with  the  original 
field  equation,  and  then  perform  a  series  of  canonical  transformations  to  systematically  generate  the  PE. 
(These  transformations  are  known  as  FW  transformations,  and  they  are  discussed  in  Section  3.1.)  In  this 
way,  an  energy-conserving  PE  that  is  accurate  to  any  given  precision  is  constructed. 

2.3.2.2  The  Practical  Utility  of  the  New  Energy-Conserving  PE 

There  is  a  very  real  need  for  such  a  robust  energy-conserving  PE,  particularly  if  it  also  turns  out  to  be 
efficient  when  applied  to  the  penetrable  rough  interface.  The  general  utility  of  the  PE  is  underscored  by 
the  fact  that  in  the  past,  as  soon  as  the  technique  was  adapted  to  a  new  context,  it  rapidly  gained  wide 
acceptance.  The  examples  that  are  particularly  relevant  to  our  discussion  here  concern  the  application  of 
PE  techniques  to  model  rough  surface  scattering  from  impenetrable  rough  surfaces.  Particularly  useful 
results  were  obtained  when  various  techniques  were  employed  to  introduce  the  Dirichlet  (air-sea 


11  Similarly,  one  could  impose  one-way  stepping  on  the  full  Helmholtz  equation  (e.g.,  see  J.A.  DeSanto,  J.S.  Perkins, 
and  R.N.  Baer,  “A  Correction  to  the  PE,”  J.  Acoust.  Soc.  Am.  64,  1664-1666,  1978).  In  this  case,  one  would  have  to 
start  with  the  field  specified  on  two  range  grid  points  rather  than  one  and  subsequently  use  the  second  downrange 
derivative  to  generate  downrange  stepping.  Once  again,  in  the  absence  of  energy  conservation,  such  a  procedure 
runs  into  problems  with  numerical  stability.  These  can  be  solved  by  artificially  enforcing  energy  conservation,  but 
this  introduces  ad  hoc  changes  to  the  problem  with  unknown  consequences.  To  consider  a  final  permutation  of  the 
examples  in  this  footnote  and  in  the  paragraph  that  references  it,  one  could  of  course  use  the  ordinary  PE  to  step 
downrange  and  impose  the  single-scatter  boundary  conditions  for  a  truly  tilted  (i.e.,  not  stepped)  interface.  Again, 
this  will  lead  to  stability  problems  associated  with  the  absence  of  a  stabilizing  conservation  law. 
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interface)  and  sometimes  also  the  Neumann  (ideal  infinitely  hard  bottom)  boundary  conditions  directly 
into  the  stepping  algorithm,  which  was  then  used  to  propagate  a  field  adjacent  to  the  rough  impenetrable 
interface  [31,  37-38,  41-46],  A  hybrid  Green’s  function/PE  technique  for  calculating  acoustic  scattering 
from  rough  impenetrable  boundaries  is  also  available  [47-52],  The  PE  technique  has  even  been 
successfully  applied  to  obtain  good  models  of  electromagnetic  scattering  from  rough  interfaces  bordering 
conducting  materials  [53-55],  In  this  case,  impedance  boundary  conditions0  were  imposed  along  the 
interface  with  the  conducting  surface.  Strikingly,  there  has  been  virtually  no  analogous  work  employing 
the  PE  to  model  the  general  problem  of  scattering  from  and  propagation  through  realistic  rough 
penetrable  surfaces  (as  opposed  to  artificially  smoothed  ones  or  crenellations).  This  is  significant  in  light 
of  the  facts  that,  as  we  have  seen,  the  problem  is  relevant  to  the  important  issue  of  roughness-induced 
penetration  into  the  ocean  bottom  [26-30],  and  that  the  PE  is  generally  the  preferred  method  for  modeling 
underwater  acoustics. 

We  are  now  in  a  good  position  to  better  understand  why  the  modeling  community  has  been  so 
reluctant  to  use  the  current  state  of  the  art  PE  formalism  for  the  penetrable  interface  (the  Stair  Step  PE)  to 
model  the  rough  interface  problem.  The  key  difference  between  this  formalism  and  those  for  impenetrable 
interfaces  is  that  in  the  latter  case,  a  flat  (range-independent)  interface  can  easily  be  distorted  into  a  rough 
one,  while  in  the  former  case  it  cannot.  This  is  significant  because  the  techniques  that  allow  us  to  adapt 
the  PE  to  rough  impenetrable  interfaces  ultimately  rely  on  conformal  mappings,  a  local  method  of  images, 
or  some  similar  distortion  of  the  range-independent  problem.  On  the  other  hand,  as  alluded  to  in  Section 
2.2  and  as  will  be  further  elaborated  in  Section  6.2.2,  the  current  state  of  the  art  theory  for  a  flat 
penetrable  interface  cannot  be  distorted  in  this  way.  The  stair  step  method  treats  the  vertical  interface  in  a 
way  that  is  fundamentally  different  from  the  way  it  treats  the  horizontal  interface  —  hence  the 
superfluous  Gibbs  oscillations  that  then  need  to  be  eliminated  by  the  ad  hoc  introduction  of  evanescent 
waves.  This  procedure  is  so  unwieldy  and  therefore  so  ill  suited  to  the  rough  surface  scattering  problem 
precisely  because  surface  roughness  cannot  be  imposed  as  a  straightforward  distortion  of  the  range- 
independent  problem.  Seen  in  this  light,  the  objective  here  is  to  develop  a  PE  for  the  penetrable  interface 
such  that  a  flat  surface  can  naturally  be  distorted  into  a  rough  one.  This  is  a  basic  property  of  any  good 
field  theory,  and  we  can  therefore  expect  it  to  emerge  from  a  PE  that  is  produced  by  a  systematic 
expansion  of  the  field  equation. 

2.3.2.3  A  Concrete  Strategy  for  Replacing  the  Current  Techniques 

Such  a  PE  will  be  developed  below  by  adapting  and  extending  a  technique  that  was  developed 
previously  for  a  similar  problem.  Underlying  this  approach  is  the  recognition  (alluded  to  above  in  Section 
2.1)  that  the  Schrodinger  equation  is  the  PE  that  corresponds  to  the  Klein-Gordon  equation  of  relativistic 
quantum  mechanics.  Once  the  relationship  between  the  quantum  and  acoustic  problems  is  recognized, 
then  the  method  used  to  systematically  derive  the  Schrodinger  equation  from  the  Klein-Gordon  equation 
can  be  applied  to  acoustics.  (This  method  is  constructed  around  the  FW  transformation  mentioned  a  few 
paragraphs  above.)  Then  the  recognition  that  the  semi-classical  (i.e.,  non-relativistic)  theory  of  the  atomic 
Lamb  shift  essentially  involves  a  rough-surface  scattering  problem  will  be  leveraged  to  establish  the 
validity  of  the  PE  based  on  our  technique  to  the  acoustic  rough  surface  scattering  problem. 


0  The  impedance  boundary  conditions  involve  constants  of  proportionality  that  are  imposed  a  priori  and  do  not 
directly  depend  on  the  value  of  the  field.  They  are  not  to  be  confused  with  impedance  matching  conditions  of  the 
sort  found,  for  example,  at  two-fluid  interfaces.  The  PEs  for  the  Neumann  and  Dirichlet  boundary  conditions 
typically  conserve  energy,  but  these  impedance  conditions  may  violate  this  conservation  rule.  As  noted  earlier, 
energy  conservation  is  a  desirable  attribute  for  a  PE  since  it  helps  ensure  numerical  stability. 
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3.  THE  FORMAL  APPROACH 

Section  3  establishes  the  basic  formal  approach  that  will  be  used  to  construct  a  new  PE  for  classical 
fields  (initially  the  acoustic  field,  but  in  Section  4.3  the  approach  will  also  be  applied  to  two  classical 
vector  fields).  At  the  core  of  this  approach  is  the  FW  transformation.  The  FW  transformation  was 
developed  in  the  late  1940s  for  use  in  quantum  mechanics  [56],  and  it  has  recently  been  applied  to  the 
acoustic  problem  [1],  The  latter  treatment  is  based  on  Chapter  4  and  Section  9.7  of  the  text  by  Bjorken 
and  Drell  [57].  Section  3.1  provides  an  overview  of  the  resulting  formal  structure,  while  the  rest  of 
Section  3  begins  to  probe  the  behavior  of  this  formalism  near  range-dependent  discontinuities  of  the 
environmental  parameters  or  their  derivatives.  (As  discussed  in  Section  2.2,  such  singularities  do  in  fact 
occur  in  a  realistic  ocean  environment.)  In  Section  3.2,  we  examine  the  effects  of  range  dependence,  and 
then  concentrate  on  the  way  that  a  cusp  in  the  sound  speed  profile  (i.e.,  a  discontinuity  of  the  sound  speed 
gradient)  induces  a  classical  Lamb  shift  when  it  is  advected  by  stochastic  range-dependent  fluctuations. 
We  also  examine  a  “toy  model”  of  the  atomic  Lamb  shift,  and  thus  provide  a  justification  for  our 
identification  of  the  acoustic  field  effect  with  the  atomic  Lamb  shift.  Finally  in  Section  3.3,  we  will  move 
from  the  stochastically  advected  cusp  to  the  numerical  evaluation  (in  a  discretized  space)  of  an  acoustic 
field  propagating  through  deterministic  interfaces  that  have  slope  and  curvature.  This  will  prepare  us  for 
our  consideration  in  Section  4  of  the  classical  Lamb  shift  associated  with  a  rough  interface  where  the 
environmental  parameters  themselves  jump,  and  then  in  Section  6  of  the  main  result  of  this  work:  a  PE 
that  incorporates  Bragg-scale  vorticity  (i.e.,  a  jump  in  the  flux  transverse  to  an  interface  where  the  density 
jumps).  Thus,  the  results  of  Section  3  are  in  Section  4,  and  the  understanding  gained  in  these  two  sections 
will  be  used  to  mount  a  broad-based  assault  on  the  core  problem  associated  with  a  rough  interface  where 
the  density  jumps. 

3.1  The  Foldy-Wouthuysen  Transformation 

The  FW  transformation  is  a  canonical  transformation  that  can  be  used  to  convert  the  full-wave 
equation  into  a  PE.  Here  in  Section  3.1,  the  related  procedure  is  introduced  and  formally  applied  to  the 
basic  (i.e.,  fixed  density)  acoustic  problem,  and  later  (in  Section  7)  the  technique  will  be  used  to  develop 
PEs  for  other  classical  fields  as  well. 

3.1.1  The  Conceptual  Framework 

This  subtle  procedure  is  best  introduced  by  briefly  reviewing  its  origins,  and  then  examining  its  broad 
outlines. 

3.1. 1.1  Exploiting  Similarities  Between  Quantum  Mechanics  and  Acoustics 

The  well-developed  formalism  from  the  quantum  mechanical  description  of  hydrogen’s  atomic 
spectrum  will  be  exploited  to  derive  a  new  acoustic  PE.  This  effort  is  built  upon  the  well-known 
observation  that  the  PE  has  the  same  form  as  the  Schrodinger  equation  of  quantum  mechanics  (e.g.,  see 
Refs.  15  (pp.  282-283),  58  (pp.  286-289)),  and,  consequently,  that  it  approximates  the  Helmholtz  equation 
for  an  acoustic  field  in  much  the  same  way  that  the  non-relativistic  Schrodinger  equation  approximates 
the  relativistic  Klein-Gordon  equation13.  The  basis  of  the  analogy  is  summarized  in  Table  1.  Time 
development  of  the  quantum  mechanical  field  corresponds  to  down-range  propagation  of  the  acoustic 
field.  The  solutions  to  the  forward-  propagating  PE  of  the  acoustic  problem  correspond  to  the  components 
of  the  quantum  field,  which  propagate  forward  in  time.  The  latter  are  commonly  referred  to  as  “particles” 
in  contrast  to  “antiparticles,”  which,  mathematically  speaking,  propagate  backwards  in  time  and  are 


p  Note  that  the  Klein-Gordon  equation  describes  bosons  (such  as  mesons)  and  not  fermions  (such  as  the  electron  and 
proton  in  the  hydrogen  atom),  so  the  direct  analogy  being  drawn  is  between  a  classical  scalar  field  and  a  mesonic 
atom,  and  not  exactly  with  the  hydrogen  atom. 
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Table  1  -  The  nature  of  the  analogy  between  the  Schrodinger  equation  for  a 
relativistic  quantum  wave  function  obeying  the  Klein-Gordon  equation  and  the 
PE  for  a  scalar  classical  field  obeying  the  Helmholtz  equation.  (For  a  more 
detailed  discussion  of  the  nonrelativistic  limit  and  the  shallow  grazing  angle 
condition,  see  Appendix  C2.3.1.) 


QUANTUM  FIELD 

CLASSICAL  LIELD 

Klein-Gordon  equation 

Helmholtz  equation  ( k0  =  m ) 

Schrodinger  equation 

Parabolic  equation  (PE) 

time 

range 

rest  mass  m 

reference  wave  number  k0 

4-dimensional  space-time 

3-dimensional  space 

Minkowski  metric  [1,-1, -1,-1] 

Euclidean  metric  [1,1,1] 

non-relativistic  limit 

shallow  grazing  angle 

antiparticles 

backscatter 

analogous  to  the  backscattered  acoustic  wave.  In  this  report,  we  complete  the  analogy  by  introducing  a 
scalar  time -dependent  external  potential  into  the  quantum  mechanical  problem.  This  is  the  direct  analog 
to  range  dependence  of  the  environment  in  the  acoustic  problem,  but  as  discussed  below,  it  is  really  a  “toy 
model”  and  not  an  accurate  physical  description  of  an  atom  made  up  of  bosons  (e.g.,  a  mesonic  atom). 

A  number  of  important  insights  emerge  from  the  analogy  between  quantum  mechanics  and  acoustics. 
Most  significantly,  quantum  field  theory  suggests  a  systematic  technique  for  generating  PEs.  The 
Schrodinger  equation,  complete  with  relativistic  corrections,  can  be  derived  using  the  FW  transformation 
[56,  57,  59].  This  transformation  provides  an  order-by-order  prescription  for  decoupling  the  forward-  and 
backward-propagating  solutions  of  the  Klein-Gordon  equation.  The  iterative  procedure  superficially 
resembles  renormalization  in  the  narrow  sense  that,  with  each  iteration,  terms  with  an  undesirable 
characteristic  (in  this  case  coupling  between  forward  and  backward  propagation)  are  shifted  to  higher  and 
higher  order. 

3.1. 1.2  A  General  Overview  of  the  Foldy-Wouthuysen  Procedure 

The  FW  procedure  consists  of  several  steps.  In  the  ansatz,  a  second-order  differential  equation  is 
converted  to  a  first-order  equation  by  adding  a  degree  of  freedom.  A  vector  is  formed,  and  its  components 
are  linear  combinations  of  the  pressure  field  and  its  first  derivative.  The  linear  combinations  are  chosen  so 
that  the  two  components  of  the  vector  are  carriers  of  the  downrange  and  uprange  flux.  This  two- 
dimensional  vector  obeys  a  PE  where  the  Hamiltonian  is  a  2x2  matrix.  A  series  of  canonical 
transformations  are  employed  to  uncouple  the  fluxes  order  by  order  in  the  PE  expansion  parameter(s).  In 
other  words,  the  off-diagonal  terms  in  the  matrix  Hamiltonian  H  are  removed,  and  each  flux  now 
propagates  independently  using  a  scalar  PE  (containing  a  scalar  Hamiltonian  H).  The  now  uncoupled 
fluxes  differ  from  the  field  by  an  operator  that  corresponds  to  a  WKB  amplitude.  Typically,  the  scalar  PE 
is  used  to  propagate  the  carrier  of  the  downrange  flux,  while  the  uprange  flux  is  zero.  The  WKB 
amplitude  is  applied  at  the  endpoint  of  the  downrange  propagation  to  recover  the  acoustic  pressure  field. 
When  the  range  dependence  is  weak,  the  WKB  amplitude  involves  a  near-eigenoperator,  and  the 
transformation  from  the  pressure  field  to  the  carrier  of  flux  (before  propagation)  and  the  transformation 
back  from  the  carrier  of  flux  to  the  pressure  field  (after  propagation)  nearly  cancel.  Under  these 
circumstances,  the  distinction  between  propagating  the  auxiliary  field  and  the  actual  acoustic  field  can  be 
ignored.  Finally,  we  note  that  the  PE  derived  using  the  FW  transformation  contains  unique  new  terms. 
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Thus,  the  PE  of  motion  propagates  an  auxiliary  field  %,  which  is  related  to  the  pressure  field  A,  but 
not  identical  to  it.  To  further  familiarize  ourselves  with  the  fundamental  attributes  of  the  procedure,  let  us 
fix  the  density  and  review  the  treatment  given  in  Ref.  1.  The  details  from  Ref.  1  are  reproduced  in 
Appendix  C  for  completeness.  (Since  we  are  not  yet  explicitly  introducing  interfaces,  the  rest  of  this 
subsection  will  apply  to  both  two-dimensional  x-z  and  three-dimensional  x-y-z  spaces.) 

3.1.2  Implementing  the  Procedure 

Now  the  broad  outline  presented  above  is  converted  into  a  specific  procedure. 

3.1.2.1  The  Ansatz  for  Acoustics 

Given  the  Helmholtz  equation 


d2A 

dx2 


+  V"A  +  k~n~  A  —  0 , 


the  FW  ansatz  becomes 


o  = 


Kx) 


f  „  i  dA  '2 
k,  dx 


A- 


dA 


V  K  dx; 


O  is  a  two-dimensional  vector.  Taking  the  inner  product  of  O  with  itself  under  the  metric  O 

fl  0^ 


*1  = 


0  -1 


we  have 


|2  1  . 


®n<&=\0\ -I  x\=—  Im 
k 


A- A 

dx 


o c  Sx=  uprange/downrange  flux  . 


(6) 


(7) 


(8) 


Thus,  0  and  %  are  indeed  carriers  of  the  uprange  and  downrange  flux,  respectively. 

3.1.2.2  Using  Canonical  Transformations  to  Decouple  Uprange  and  Downrange  Solutions 

As  shown  in  Appendix  C.1.1  (and  in  Section  I.A  of  Ref.  1),  the  definitions  in  Eq.  (7)  combined  with 
the  Helmholtz  equation  (Eq.  (6))  generate  the  state  space  equation: 

90 

i  —  =  7fO, 

dx 


where 


H  =  0  +  S  +  kji. 


(9) 


i]  is  defined  in  Eq.  (8)  above,  and  the  odd  operator  O  and  the  even  operator  £  are  given  by 


0  =  A4 

£  =  Arj  ’ 


(10) 
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A  is  as  in  Eq.  (5)  and  £  is  a  2x2  matrix  that  is  “odd”  in  the  sense  that  it  anti -commutes  with  /; : 

f  0  O 


-1  o 


(ii) 


Note  that  lj  is  “even”  in  the  sense  that  it  anti-commutes  with  itself. 

As  described  in  Refs.  1  and  57  and  in  Appendix  C.2.3.2,  the  FW  transformation  consists  of  a 
transformation 

0  = 


(0/ 

— >  0  = 

vX 

such  that 


O  =  e,s0  with  S  = 


irjO 

2k 


(12) 


and  the  transformation  77  —>  77 

77  =  e'sTLe~'s  +  [terms  generated  by  the  range  dependence] .  (13) 

This  transformation  is  canonical  in  the  sense  that  the  basic  form  of  the  equation  is  unchanged: 

90 

i —  =  770. 
dx 

It  is  iterative  in  that  it  is  used  over  and  over  again  to  eliminate  odd  terms  that  couple  uprange  and 
downrange  propagation.  The  order  of  the  remaining  odd  terms  increases  with  each  iteration  of  the 
procedure. 

The  Hamiltonian  diagonalized  (denoted  by  a  tilde)  to  fourth  order  (denoted  by  the  Roman  numeral 
IV  )  is  given  by 

H‘v=k0ii  +  £",  (14) 

where 


n 


o2  o4 


gii  _  V  o 

1 


2  kn  8  k. 


+  £-A[0,[0,£]]  '  [0,0] 


o ) 


8  k, 


8  k- 


(15) 


+  ^j(-[0,£]2 -i{6,[0,£]}  +  02))+  5th  order 

8^o  ' 


As  always,  the  dot  denotes  the  downrange  derivative  d/dx.  Orders  are  denoted  by  powers  of  the 
original  odd  and  even  operators  O  and  £ ,  as  well  as  powers  of  the  downrange  derivative  d/dx .  (The 
commutator  [,]  and  anti-commutator  {,}  are  defined  in  footnote  f.)  Equation  (15)  is  a  result  taken  from 
the  treatment  of  the  FW  transformation  in  Ref.  57  extended  up  an  order  in  Ref.  1.  The  calculation  is 
reproduced  in  Appendix  C.2.4,  where  Eqs.  (14)  and  (15)  are  simply  Eq.  (C.28)  and  (C.29). 

Equation  (15)  is  quite  general.  It  is  a  direct  consequence  of  the  fact  that  the  initial  odd  operator  O  is  a 
sum  of  operators'1  multiplied  by  £,  or  )]q ,  and  the  initial  even  operator  £  is  a  linear  combination  of 


q  These  operators  are  Hermitian  or  anti-Hermitian  depending  of  whether  they  are  coefficients  to  </  (which  is  anti- 
Hermitian);  or  of  either  ijc; ,  ij  or  1  (which  are  all  Hermitian);  and  whether  the  overall  Hamiltonian  77  is 
Hermitian  or  pseudo-Hermitian. 
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operatorsq  multiplied  by  tj  or  1 .  Since  these  four  matrices  span  the  space  of  2  X  2  matrices,  it  is  always 
possible  to  break  a  given  2x2  matrix  Hamiltonian  TL  into  such  O  and  8  operators.  As  discussed  at  the 
end  of  Appendix  C.2.3.2,  the  result  shown  in  Eq.  (15)  depends  on  the  fact  that  ±rj ,  ±% ,  ±11%  ,  and  ±1 
(the  2x2  matrices  that  operate  on  state  space  vectors)  are  closed  under  multiplication.  (In  fact,  they  form 
a  group.  Again,  this  is  no  surprise  since  the  four  matrices  span  the  space  of  all  2x2  matrices).  This 
means  that  at  all  iterations  of  the  procedure,  the  odd  operator  O  continues  to  be  a  sum  of  operators1* 
multiplied  by  %  or  TJ% ,  and  the  even  operator  £  continues  to  be  a  sum  of  operators'1  multiplied  by  //  or 
1 .  The  matrices  %  and  /]%  are  called  odd,  because  they  anti-commute  with  // ,  and  the  matrices  //  and  1 
are  called  even  because  they  commute  with  // .  The  anti-commutation  relation  implies  that  {Oj]\  =  0 . 

This  condition,  along  with  the  basic  form  of  the  definition  of  S  given  in  Eq.  (12)  and  the  basic  form  of  the 
Hamiltonian  given  in  Eq.  (9),  are  all  that  is  needed  so  that  the  FW  transformation  works  properly. 

Thus,  the  procedure  developed  by  Foldy  and  Wouthuysen  applies  whenever  the  field  equation  can  be 
written  in  the  form  of  the  state  space  equation  (Eq.  (9)),  where  in  some  sense  the  initial  odd  operator  O 
is  a  linear  combination  of  %  and  1]%  ,  and  the  corresponding  even  operator  £  is  a  linear  combination  of 
7/  and  1 .  It  turns  out  that  this  will  trivially  be  the  case  when  we  consider  the  variable -density  acoustic 

equation  (here,  Tt  is  still  a  basic  2x2  matrix),  and  it  will  also  hold  in  some  broader  sense  when  we 
consider  vector  fields  (e.g.,  electromagnetic  and  elastodynamic  fields).  For  the  vector  fields,  the  situation 
will  be  similar  to  that  in  the  quantum  mechanical  problem.  In  the  quantum  problem,  O  is  constructed  of 
matrices  that  are  outer  products  of  the  form  <7j  0  (f]%)  ( <7,  are  Pauli  spin  matrices),  and  both  £  and  the 

term  corresponding  to  k0rj  are  proportional  to  107/.  The  closed  multiplicative  structure  and 
commutation  rules  needed  for  the  FW  transformation  are  again  generated  by  the  group  of  2  X  2  matrices 
made  up  of  % ,  1]% ,  7/ ,  1 ,  and  their  negatives.  As  the  FW  procedure  is  implemented,  these  will  cycle 
through  in  the  right-hand  slot  of  the  outer  product.  (As  discussed  in  Appendix  C.2.3.1,  the  quantum 
mechanical  problem  also  follows  a  slightly  different  set  of  power-counting  rules,  and  as  we  see  at  various 
points  throughout  the  treatment  below,  it  subtly  differs  in  various  other  respects.)  We  see  in  Section  7  that 
the  Hamiltonians  7 i,  for  electromagnetic  and  elastodynamic  fields  are  also  constructed  from  matrices  that 
involve  outer  products  between  submatrices  related  to  the  detailed  properties  of  the  field,  and  the  usual 
group  of  2  X  2  matrices  associated  with  state  space. 

3.1.2.3  The  Decoupled  State  Space  Equation  for  Acoustics 

Substituting  for  the  values  of  O  and  £  specific  to  the  acoustic  constant-density  problem  and 
performing  two  extra  FW  transformations  to  generate  manifest  range -reversal  symmetry  (denoted  by  the 
superscript  2),  this  leads  to 


nN2  =  nK 


i+ 


22 _ 2_  2- 

r  8F  4^ 


-  + 


3{2,2} 


1 6k, ; 


+  5  th  order 


(16) 


This  is  Eq.  (51b)  of  Ref.  1  and  Eq.  (C.345)  in  Appendix  C.2.5.  Here,  orders  are  obtained  by  counting  the 
powers  of  2  ,  and  the  number  of  times  the  downrange  derivative  d/dx  operates  on  a  2  .  For  example,  2 

is  third  order,  and  2“  is  fourth  order.  The  significance  of  range-reversal  symmetry  is  discussed  briefly  in 
Section  3. 1.3. 3  and  Appendix  C.2.5. 
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Defining  the  scalar  Hamiltonian  H  : 

H  =  k, 
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8  k, 
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(where  the  square  root  operator  must  be  expanded  to  finite  order  in  2 A/ kfj ),  we  have  the  result  77  =  r/H  . 
Note  that 


r)(j>  -  ~ 

i - =  770  where  0  = 

dx 


\Xj 


We  will  generally  be  interested  in  one-way  (downrange)  propagation  (i.e.,  0  =  0  ).  In  such  propagation, 
we  drop  the  tilde  on  % ,  and  from  now  on  j  ust  use  X  ■  Thus,  the  scalar  Hamiltonian  H  generates 
downrange  propagation  via  the  PE: 


~i  =  H X  ■  (18) 

ox 

We  notice  that  the  Hamiltonian  associated  with  this  PE  involves  the  familiar  expansion  of  the  square  root 
operator,  and  new  terms  that  involve  downrange  derivatives.  These  additional  terms  are  strictly  associated 
with  the  range  dependence. 

3.1.2.4  The  Auxiliary  Field  X 

Note  that  Eq.  (18),  the  PE  obtained  using  the  FW  transformation,  actually  propagates  an  auxiliary  field 
X  ■  In  Appendix  D.l,  we  show  that,  ignoring  the  range  dependence  locally  (but  not  necessarily  at  more 
distant  values  of  the  range  or  the  transverse  coordinates),  the  field  %  is  related  to  the  physical  pressure 
field  by  the  equation 


(19) 


In  other  words,  the  FW  transformation  indeed  recovers  the  result  so  that  the  PE  actually  propagates  an 
auxiliary  field  where  the  WKB  amplitude  has  been  removed  from  the  pressure  field. 

3.1.2.5  How  Energy  Conservation  is  Built  into  the  Formalism 

As  discussed  in  Ref.  1 ,  the  FW  procedure  is  designed  to  maintain  certain  conservation  laws.  In  Section 
LB  of  this  reference  (or  equivalently  Appendix  C.2.1  below),  it  is  demonstrated  that  conservation  of  the 
total  downrange  energy  flux  forces  the  matrix  Hamiltonian  77  originally  generated  by  the  FW  ansatz  to 
obey  a  property  called  pseudo-Hermiticity.  (Pseudo-Hermiticity  means  that  ( //'77 )  =  //77 ,  where  //  is, 
as  always,  the  matrix  defined  in  Eq.  (8)  above  and  the  dagger  1  denotes  Hermitian  conjugation.)  As 
shown  in  Section  II. A  of  Ref.  1  (or  equivalently  in  Appendix  C.2.1  below),  the  FW  transformation  is 
designed  to  be  pseudo-unitary  so  that  pseudo-Hermiticity  of  the  matrix  Hamiltonian  77  is  maintained  as 
it  is  diagonalized  by  the  FW  procedure.  (Some  of  the  formal  issues  discussed  just  above  are  also  briefly 
revisited  at  the  beginning  of  Appendix  I  below.)  Pseudo-Hermiticity  of  the  diagnonalized  matrix 
Hamiltonian  in  turn  forces  the  scalar  Hamiltonian  H  defined  in  Eq.  (17)  to  be  Hermitian  (i.e.,  H  =  H 
since  the  diagonalized  matrix  Hamiltonian  is  r/H  and  rj~  =  1 ).  The  Hermiticity  of  H  is  also  confirmed 
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by  direct  inspection  of  Eq.  (17)  (provided,  of  course,  that  the  square  root  is  understood  to  be  a  placeholder 
for  its  finite  series  expansion).  If  we  take  a  range  derivative  of  the  integral  over  transverse  space  of  the 
magnitude  squared  of  %  (i.e.,  J  dR,  \%\  ),  use  the  parabolic  Eq.  (18)  to  replace  downrange  derivatives  of 
X  with  the  Hamiltonian  operating  on  X  -  and  apply  Hermiticity  of  the  Hamiltonian  H,  it  follows  that 

dRr  \x\  is  conserved  during  downrange  propagation.  The  line  of  reasoning  just  completed  ties  the 
J  I  |2 

conservation  rule  for  jx  to  the  conservation  of  the  downrange  flux,  and  so  to  the  conservation  of  energy. 
As  discussed  in  Appendix  A,  the  connection  between  the  auxiliary  field  X  and  the  energy  flux  serves  to 
illuminate  the  relationship  between  X  and  the  pressure  field  A  as  given  by  Eq.  (19). 

An  important  related  point  is  also  discussed  in  Appendix  A.  If  there  is  no  range  dependence,  there  is 
no  mode  coupling,  and  the  endpoint  transformations  from  A  to  X  at  the  beginning  and  X  back  to  A  at 
the  end  precisely  cancel.  If  the  coupling  is  weak  in  the  sense  that  most  of  the  coupling  is  between  nearby 
eigenfunctions — nearby  means  the  eigenvalues  are  close,  the  endpoint  transformations  nearly  cancel.  In 
such  typical  cases,  reasonably  accurate  answers  can  be  obtained  using  the  PE  to  directly  propagate  A. 
Although  this  is  often  done,  it  is  good  practice  to  keep  in  mind  that  this  is  an  approximation,  and  that  the 
PE  really  propagates  an  auxiliary  field  X  ■ 

3.1.3  Comments  cmd  Implications 

Recall  that  in  this  section  interfaces  were  not  explicitly  introduced,  and  so  the  discussion  here  applies 
to  both  three-dimensional  and  two-dimensional  spaces.  In  Section  3.3.3,  we  return  to  the  interface 
problem,  and  once  again  restrict  ourselves  to  two-dimensional  space  and  one-dimensional  interfaces. 

Thus,  here  in  Section  3.1,  the  FW  transformation  has  been  borrowed  from  quantum  mechanics  and 
applied  to  derive  the  basic  acoustics  PE  (i.e.,  constant  density,  but  the  sound  speed  can  vary).  The 
treatment  in  this  section  is  similar  to  the  derivation  of  the  Schrodinger  equation  from  the  Klein-Gordon 
equation  given  in  the  classic  relativistic  quantum  mechanics  text  by  Bjoken  and  Drell  (Ref.  57;  Section 
9.7).  Two  aspects  of  the  problem  known  previously  to  the  acoustic  community  from  other  considerations 
have  been  brought  into  the  FW  formulation  for  the  first  time.  These  are  the  explicit  relationship  between 
the  PE  field  and  the  full-wave  field,  and  the  introduction  of  a  true  scalar  potential  into  the  PE  formalism. 
New  ideas  also  flowed  in  the  other  direction  as  the  FW  transformation  for  the  first  time  introduced  into 
the  acoustic  PE  terms,  which  are  related  to  range  dependence,  and  which,  as  we  will  see  in  the  next 
section,  generate  the  classical  equivalent  of  an  often-overlooked  component  of  the  atomic  Lamb  shift 
(known  as  “vacuum  polarization”).  Finally,  an  entirely  new  innovation  to  the  FW  procedure  was 
introduced  as  an  extra  FW  transformation  was  performed  to  make  the  scalar  Hamiltonian  H  (and  so 

77  =  VjH  )  manifestly  range -reciprocal.  Let  us  examine  these  features  more  closely. 

3. 1.3.1  Insights  Brought  from  Acoustics  into  the  Foldy-Wouthuysen  Procedure 

The  first  insight  brought  from  acoustics  into  the  FW  formal  structure  is  the  relationship  between  the 
auxiliary  field  X  propagated  by  the  PE  and  the  acoustic  pressure  field  A  propagated  by  the  Helmholtz 
equation.  As  shown  in  Eq.  (19),  the  two  fields  differ  by  a  quarter-power  operator  sometimes  known  as  a 
WKB  amplitude  (because  the  same  operator  shows  up  as  an  amplitude  term  in  the  WKB  approximation). 
This  was  never  a  consideration  in  the  quantum  problem  as  described  in  Ref.  57,  Section  9.7,  because  in 
that  problem  one  is  only  concerned  with  expectation  values.  These  are  integrals  over  transverse  (in  this 
case,  three-dimensional  physical)  space  of  the  inner  product  O/7O,  where  O  is  as  always  the  field  in 
state  space.  The  operator  transforming  the  original  field  O  to  O ,  the  field  propagated  by  the  diagonal 
Hamiltonian,  is  pseudo-unitary,  and  so  the  transverse  integral  of  the  inner  product  O  T/Ois  the  same  as 
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that  of  O  /7O,  and  so  it  automatically  yields  the  desired  expectation  value,  and  there  is  no  need  to 
directly  calculate  the  original  field  O .  In  the  acoustic  problem,  on  the  other  hand,  we  can  observe  and 
therefore  need  to  recover  the  actual  acoustic  field  A,  and  so  in  this  case  we  need  to  be  able  to  explicitly 
transform  back  and  forth  between  the  PE  field  %  and  A. 

The  second  innovation  brought  from  the  acoustic  formalism  into  the  FW  formal  structure  is  the 
concept  of  the  true  scalar  potential.  The  acoustic  scalar  potential  was  introduced  directly  into  the 
Helmholtz  equation  (as  a  spatial  dependence  on  the  sound  speed).  It  shows  up  in  the  odd  operator 
O  =  AE,  ,  since  the  operator  A  incorporates  both  a  “kinetic  energy”  part  Vr  •  VT/2kQ  and  a  potential 

— k0/J .  On  the  other  hand,  for  a  quantum  field  obeying  the  Klein-Gordon  equation  (e.g.,  a  pionic  atom), 
the  operator  corresponding  to  A  only  includes  a  kinetic  energy  term  n  ■  ft /2m  ,  where  ft  is  a  generalized 

momentum  7t  =  p  —  A  (here  A  is  a  vector  potential).  In  Ref.  57  (Section  9.7),  the  scalar  potential  is 
introduced  directly  into  the  state  space  equation  as  a  scalar  times  the  unit  matrix.  Since  it  has  intrinsic 
matrix  properties,  it  only  shows  up  in  the  even  operator  £  ,  but  not  in  the  odd  operator  O.  The  true  scalar 
potential  is  therefore  new  to  the  blend  of  the  FW  approach  and  acoustics. 

3. 1.3.2  Insight  that  the  Foldy-Wouthuysen  Procedure  Brings  to  Acoustics 

The  approach  based  on  the  FW  transformation  introduces  something  new  into  the  basic  acoustic 
PE.  These  are  new  terms  that  are  generated  by  range  dependence  -  for  example,  the  term  — A/Sk q  in  Eq. 

(17).  We  will  examine  these  in  detail  shortly  and  discover  that  these  terms  are  connected  to  the  classical 
equivalent  of  the  atomic  Eamb  shift. 

3.1.3.3  What  is  Completely  New  in  the  Approach  Developed  Here? 

Finally,  there  is  something  entirely  new  to  the  treatment  of  the  FW  transformation  outlined  above. 
Following  a  suggestion  by  Dashen,  an  extra  FW  transformation  is  used  so  that  the  new  range -dependent 
terms  introduced  by  the  basic  FW  transformation  exhibit  range-reversal  symmetry  (i.e., 
7 ~C  (— x)  =  7 i  (v) ).  This  is  a  symmetry  exhibited  by  the  Helmholtz  equation,  and  it  is  therefore  expected 
that  an  optimal  parabolic  approximation  to  it  obeys  it  as  well.  In  the  deterministic  problem,  the  term 
—  A/&ky,  for  example,  replaces  the  term  Ak2^A,Aj.  In  the  stochastic  problem,  this  change  will 

effectively  move  incoherent  terms  like  S^j  and  ^(V7//)  V7c)^  to  the  purely  coherent  term 

(jj)(z)  ( 8x  's  ^e  incoherent  part  and  (z)  the  coherent  part  of  the  field:  X  =  {z)  +  $X )•  This  will  be 

very  important  so  that  we  will  be  able  to  correctly  recognize  the  Famb  shift  as  a  purely  coherent  effect. 
Thus,  the  range-reversal-invariant  formulation  transparently  leads  to  correct  physical  interpretations  that 
would  otherwise  be  obscured.  This  version  of  the  result  also  happens  to  be  easier  to  evaluate,  again 
because  it  exploits  a  key  symmetry  of  the  problem.  However,  we  need  to  note  that  the  Dirac  equation 
exhibits  a  symmetry  that  is  a  little  more  subtle  than  pure  time  reversal,  and  so  this  particular  operation  is 
not  appropriate  for  the  atomic  problem. 

3.1.3.4  Summing  up  the  Flow  of  Ideas 

An  accounting  of  what  has  been  brought  from  acoustics  into  the  FW  formalism,  what  has  come  from 
the  FW  formalism  into  acoustics,  and  what  is  unique  to  the  new  synthesis  is  given  in  Table  2.  From  here 
on,  the  material  is  generally  unique  to  the  effort  developed  here. 
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Table  2  -  This  table  summarizes  the  six  paragraphs  above.  It  catalogues  what  is  new  to  the 
approach  here,  what  has  come  from  the  existing  knowledge  base  in  acoustics,  and  what  has  come 
from  the  existing  formal  structure  developed  for  the  FW  transformation  in  the  context  of  quantum 
mechanics. 


The  LW 

Transformation  as 
used  for  Scalar 
Quantum  Lields 
(Bjoken-Drell  [57], 
Section  9.7) 

PE  Developed  by  the 
Acoustics  Community 
Using  Traditional 
Methods 

New  Treatment  of  the 
Acoustics  PE  Using 
the  FW 

Transformation 

Known  relationship 
between  PE  field 

X  and  full  wave  field 
( <p  or  A) 

No 

Yes 

Yes 

True  scalar  potential 

No 

Yes 

Yes 

Unique  range- 
dependent  terms 
(ultimately  connected 
to  a  new  type  of 

Lamb -shift  effect) 

Yes 

No 

Yes 

Extra  LW 
transformations  so 
that  the  new  terms 
exhibit  range/time- 
reversal  symmetry 

No 

No 

Yes 

3.2  The  Classical  Lamb  Shift  for  a  Cusp  and  a  “Toy  Model”  for  the  Atomic  Lamb  Shift 

3.2.1  The  Physical  Implications  of  a  Singularity 

When  there  is  a  jump  in  /l  or  in  the  derivative  of  LI ,  then  the  “new”  term  in  the  FW  transformation 

— =  jj/ftk^  generates  S'-  or  <!> -functions,  respectively.  The  appearance  of  such  J-function 

potentials  in  a  PE  has  interesting  precedents  in  the  Schrodinger  equation  for  the  hydrogen  atom.  These  so- 
called  contact  potentials  are  associated,  for  example,  with  the  fine  and  hyperfine  structure  of  the  spectrum 
of  the  atom.  One  of  these  contact  potentials  has  particular  relevance  to  the  present  discussion.  It  emerges 
when  the  FW  transformation  is  used  to  obtain  a  semirelativistic  picture  of  the  atomic  Lamb  shift  [60,  61]. 
In  these  calculations,  vacuum  fluctuations  are  imposed  by  hand  as  a  kind  of  incident  radiation,  or 
alternately  as  a  time  dependence  of  the  scalar  and/or  vector  potentials.  The  calculations  presented  in  Refs. 
60  and  61  are  sophisticated  extrapolations  of  Welton’s  qualitative  description  of  the  Lamb  shift  [62], 
which  is  nicely  described  in  Ref.  57. 


Lrom  Eq.  (4.5)  in  Ref.  61  and  the  discussion  that  follows,  we  learn  that  the  Lamb  shift  occurs  when  the 
Coulomb  (i.e.,  the  electrostatic)  potential,  which  is  singular  at  the  origin,  is  averaged  as  the  electron 
vibrates  under  the  influence  of  vacuum  fluctuations.  The  averaging  process  generates  a  distribution  (we 
also  need  to  make  use  of  the  general  result  V"  (l/r)  =  —4  7rS  (f)  J.  Cohn-Tanoudji  et  al.  note  elsewhere 
that  “the  average  potential  differs  from  the  value  of  the  potential  at  the  average  position  only  inside  the 
source  of  the  Coulomb  potential,  hence  the  function  8 ( r)  ”  [63].  Immediately  below,  they  note  “...the 
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Lamb  shift  is  a  quantum  effect,  just  like  the  vacuum  fluctuations  that  give  rise  to  it.  Nevertheless,  once 
the  existence  of  these  fluctuations  is  acknowledged,  their  effect  is  the  same  as  the  one  which  would  be 
produced  by  a  random  classical  field  characterized  by  a  spectral  density  corresponding  to  an  energy 
hco/ 2  per  mode”  [63], 

Significantly,  the  very  same  singularities  that  are  responsible  for  the  traditional  Lamb  shift  discussed 
in  the  previous  paragraph  also  cause  the  new  FW  terms  associated  with  vacuum  fluctuations  to  include 
distributions  (e.g.,  (^-functions).  These  (^-functions  (and  sometimes  also  other  distributions  such  as 
some  of  those  proportional  to  cT- functions)  are  immune  to  cancellations  during  averaging,  and  they 
provide  correction  terms  to  the  standard  Lamb  shift. 

Moving  to  the  classical  problem,  range  dependence  takes  the  place  of  time  dependence,  and  a 
stochastic  range  dependence  in  the  environmental  parameters  replaces  the  vacuum  fluctuations.  This 
analogy  exploits  the  fact  that  vacuum  fluctuations  can  be  interpreted  as  a  stochastic  time  dependence  of 
the  potentials  in  the  atomic  problem.  For  example,  we  can  imagine  vacuum  fluctuations  advecting  the 
nucleus  of  an  atom.  As  the  source  of  the  scalar  potential  (for  example)  moves  back  and  forth,  a  time 
dependence  of  this  potential  is  induced.  The  world  line  of  the  nucleus  (embedded  in  four-dimensional 
space-time)  forms  a  rough  surface  along  which  the  potential  is  singular.  Similarly,  in  the  classical 
problem,  a  quasiplanar  interface  along  which  an  environmental  parameter  or  its  gradient  is  singular  forms 
a  rough  surface  in  three-dimensional  space. 

Building  on  the  analogy  to  atomic  physics,  the  classical  Lamb  shift  occurs  when  the  second  derivative 
in  transverse  space  becomes  a  distribution.  This,  in  turn,  means  that  when  the  singularity  is  advected  by 
stochastic  fluctuations,  it  will  average  to  a  distribution  at  the  origin.  The  new  terms  from  the  FW 
transformation  involve  the  fluctuations  in  the  uprange/downrange  direction,  and  these  terms  are 
associated  with  a  comparatively  obscure  contribution  to  the  Lamb  shift.  The  better-known  and  larger  part 
of  the  Lamb  shift  effect  involves  fluctuations  in  transverse  space.  The  semiclassical  theories  that  deal  with 
the  Lamb  shift  often  involve  only  the  latter.  We  consider  both  below,  but  concentrate  particularly  on  the 
part  associated  with  fluctuations  in  the  downrange  direction  (i.e.,  the  new  terms  that  only  appear  when  the 
FW  transformation  is  used  to  generate  the  PE),  since  these  are  completely  new  to  acoustics.  Eventually, 
we  recognize  that  the  acoustic  effect  induced  by  these  terms  is  analogous  to  the  vacuum  polarization 
contribution  to  the  atomic  Lamb  shift. 

Specifically  in  the  constant-density  acoustic  problem,  a  discontinuity  in  the  sound  speed  or  in  its  first 
derivative  will  force  the  second  derivative  with  respect  to  the  variable(s)  of  transverse  space,  and  so  also 
the  lowest  order  FW  term  (involving  a  second  range  derivative  of  JU  =  \(\  —  ft2)  ),  to  pick  up  a  5- 
and/or  a  c> -function,  respectively1)  Types  of  the  sound  speed  profiles  that  will  lead  to  these  contact 
potentials  are  illustrated  in  Fig.  3.  After  stochastic  averaging,  a  c) -function  survives  along  the  average 
plane  of  a  cusp  of  the  sound  speed,  but  the  FW  term  associated  with  a  sound  speed  jump  will  average  to 
zero.  Furthermore,  it  turns  out  that  stochastically  averaging  the  sound  speed  jump  or  cusp  as  it  fluctuates 
in  transverse  space  will  lead  to  a  S'-  and/or  a  (^-function,  respectively,  along  the  average  plane.  The 
effects  associated  with  downrange  and  transverse  fluctuations  combine  to  form  the  acoustic  Lamb  shift. 


r  As  discussed  in  Section  3.3.3,  these  contact  potentials  modify  the  boundary  conditions  of  the  field  at  the  surface 
along  which  fj  or  its  derivative  jumps.  This  will  be  examined  in  much  more  detail  throughout  Section  4,  where  it  is 
shown  that  these  contact  potentials,  coupled  with  the  lowest-order  part  of  the  basic  range-independent  Hamiltonian, 
H  =  k0  +  A  ,  lead  to  new  boundary  conditions  on  the  field  and/or  its  first  derivative. 


Using  the  Foldy-Wouthuvsen  Transformation 


27 


Distributions  (8- 
functions)  are 
generated  at  cusps 
and  jumps  of'the 
sound-speed  profile 


Fig.  3  —  The  traditional  Lamb  shift  emerges  as  a  fluctuating  singularity,  such  as 
a  sound  speed  cusp  or  jump,  is  averaged.  The  new  terms  introduced  by  the  FW 
transformation  will  in  the  context  of  acoustics  generate  contact  potentials  at 
jumps  in  the  sound  speed  or  its  gradient.  These  contact  potentials  very  closely 
resemble  the  true  Lamb  shift  effects.  In  fact,  at  a  sound  speed  cusp  and 
apparently  also  at  a  density  jump  (but  not  a  sound  speed  jump),  the  new  FW 
term  survives  averaging  to  contribute  to  the  stochastic  problem,  and  so 
contribute  to  the  true  Lamb  shift.  The  contribution  from  the  FW  term  will  be 
called  a  downrange  Lamb  shift,  as  opposed  to  the  traditional  Lamb  shift  (also 
called  here  a  transverse  Lamb  shift).  The  meaning  of  this  terminology  will 
become  clearer  below. 


with  the  discontinuity  in  the  first  derivative  of  the  sound  speed  forming  the  direct  analogy  to  the  quantum 
Lamb  shift.  The  assertions  in  this  section  are  summarized  in  Fig.  4. 

3.2.2  A  Range-dependent  Environment  Generated  by  Stochastic  Fluctuations 

3.2.2.1  Parameterizing  a  Range-dependent  Environment  as  a  Perturbation  of  a 
Range-independent  Environment 

Now,  let  us  examine  the  specifics.  For  the  current  context,  assume  two-dimensional  space 
where  x  is  the  range  and  z  is  the  depth.  To  begin  with,  consider  a  sound  speed  profile  without 

discontinuities  in  either  the  sound  speed  or  its  gradient  (or  equivalently,  in  either  =  —  or 

its  gradient).  We  have  Ll(x,z)\  —  \,Ll\  (z) .  Now  distort  it  by  a  function  of  the  range: 

v  7 1  undistorted  L  J  v  7 

Listened  =>M(x,z)  =  [/l](Z-f(x))  (Fig.  5). 

More  generally,  we  should  consider  a  distortion  that  varies  with  the  depth:  /  (x,  z)  ■  This  problem  is 
similar,  but  there  is  one  extra  tricky  issue.  As  long  as  the  scalar  Hamiltonian  H  is  a  simple  function,  then 
we  can  fix  z  and  Taylor  series  expand  in  /  (x,z)  ■  However,  higher-order  Hamiltonians  contain  cross¬ 
terms,  proportional  to  d"/j/dz"  ,  and  these  will  also  pull  down  extra  terms  proportional  to  d"  f  /dz"  ■ 
This  can  add  considerable  complexity  to  our  calculation,  without  adding  anything  new  conceptually. 
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An  example  of  a  classical  Lamb  shift 


Atomic  Physics 


Acoustics 


j  wi 


world  line  of  nucleus 


z 


interface  where 

environmental  parameters  jump 


x 


Roughness  induced  by 
vacuum  fluctuations 
(2nd  quantization) 

Contact  potential  comes  from: 
Vl  (l/l/M)=  -4 ttS(R) 

Average  energy  eigenvalue 
shifts  (of  s-state) 


Rough  surface  induced  by  the 
complexities  of  a  macroscopic 
environment 

Cusp:  Vz2(z0(z))  =  S(z) 

Interface:  V2(0(z)J  =  S\z) 

Average  downrange  wavenumber 
shifts 


Fig.  4  —  The  new  terms  predicted  by  the  FW  transformation  are  related  to  time- 
(or  range-)  dependence.  In  atomic  physics,  vacuum  fluctuations  induce  a  time- 
dependence,  while  for  classical  fields  the  complex  environment  involves  a 
range-dependence.  The  FW  transformation  predicts  new  terms,  which  lead  to 
contact  potentials  either  at  the  source  of  field  acting  on  a  electron,  or  alternately 
when  the  sound  speed  or  its  gradient  is  discontinuous.  In  the  stochastic  problem, 
an  averaged  rough  surface  essentially  generates  similar  terms,  which  directly 
modify  the  coherent  field.  As  discussed  in  greater  detail  in  Section  4.4,  these 
terms  modify  eigenvalues  of  the  coherent  field  rather  than  coupling  its  modal 
components.  The  connection  between  the  new  terms  induced  by  the  FW 
transformation  and  the  stochastic  problem  suggests  that  the  FW  transformation 
effectively  induces  a  kind  of  averaging,  which  buffers  the  singularity. 


A(*.  z)Ldist0Ited  =  [A]  (z)  M(x,  z)  =  [a]  (z  ■ -  fix)) 

Fig.  5  —  A  range -dependent  sound  speed  profile  is  generated  by  distorting 
a  range-independent  sound  speed  profile  by  a  function  fix) 


Since  this  is  a  side  issue  in  the  current  context,  and  these  terms  are  generally  small  anyway,  we  will 
ignore  them  here.  What  is  of  particular  interest  to  the  current  investigation  is  the  lowest-order  of  the  new 

terms  introduced  by  the  FW  transformation:  —X/Sk„  —  jU/Sk ^  ,  and  for  this  term,  we  explicitly  evaluate 
the  contribution  to  this  term  from  volume  scattering  and  show  for  the  stochastic  problem  that  the  z- 
dependence  in  /  does  not  change  the  result  at  all  (see  Appendix  E). 
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3.1.2.2  Stochastically  Averaging  the  Perturbative  Range-Dependence 

Recall  the  Hamiltonian  (Eq.  (17)).  For  the  time  being,  let  us  include  only  powers  of  A  and  not 
downrange  derivatives  in  the  power  counting.  This  allows  us  to  examine  the  lowest-order  FW  term  in  a 
workable  and  familiar  environment  -  as  a  modification  to  the  familiar  first-order  PE.  We  now  have  a 
“quasi-first  order”  Hamiltonian: 


H(«)=C1+r--T=[H](z-/)+J 1  (20) 

8k0  8  k0 

where  [ H ] (z)  is  the  undistorted  range -independent  Hamiltonian  (i.e.,  H  (x,z)  —  \H](z  ^  f  (v))). 
Now,  average  the  equation 


and  expand 


H(x,z)  =  [H](z)  +  ^-  + 


8k„ 


dz 


2  dz 


+  0(f3). 


As  we  will  see  shortly,  the  terms  in  the  box  are  related  to  previously  known  types  of  scattering.  Break  the 
field  into  coherent  and  incoherent  &X  components: 


and  make  use  of  the  fact  that 


Xix^)  =  {x)ix^)  +  sX 

<*>.</>  =  o 

{fSx),(f2)*o' 


This  leads  to 


8  kn 


dz 


nnti] 

2  dz2 


<*}+•■•■ 


(21) 


The  first  term  on  the  right-hand  side  of  Eq.  (21)  is  the  right-hand  side  of  the  unperturbed  (range- 
independent)  problem.  The  second  term  is  the  averaged  “FW  term.”  We  evaluate  this  term  in  Section 
3.2. 2. 3. 

The  third  term,  (/  ■  8x)d  [//]/3z ,  is  the  well-known  diffuse-scattering  term.  This  non-Hermitian 
term  measures  the  losses  to  the  coherent  wave  as  some  of  it  is  scattered  into  the  incoherent  fields.  One 


s  This  use  here  of  the  term  “diffuse  scattering”  is  quite  broad.  It  subsumes  several  types  of  coherent-incoherent 
coupling,  including  Bragg  scattering,  mode  coupling  and,  as  is  noted  in  Section  4.4,  the  transition  probabilities  in 
time -dependent  perturbation  theory  in  quantum  mechanics.  When/encompasses  wavelength  scales,  the  contribution 
from  this  term  is  dominated  by  Bragg  scattering,  and  for  this  reason  diffuse  scattering  is  often  associated  with  this 
phenomenon.  Bragg  scattering  emerges  naturally  from  perturbation  theory  (both  in  the  context  of  the  PE  and  in  full- 
wave  theory).  For  this  effect,  the  inhomogeneities  in  the  volume  (or  similarly  the  rough  surface)  look  like  a 
collection  of  diffraction  gratings,  which  in  higher-order  perturbation  theory  become  modulated  by  diffraction  effects 
induced  by  scales  larger  than  the  Bragg  wavelength  (i.e.,  the  size  of  the  effective  diffraction  grating).  As  used  here, 
diffuse  scattering  also  includes  weak  mode-coupling  induced  by  large  scales  acting  alone  in  the  absence  of  Bragg- 
scale  roughness.  The  latter  effect  is  very  similar  to  the  coupling  between  atomic  levels  induced  by  slow  time 
dependence.  Vacuum  fluctuations  are  multiscale  and  can  in  principle  induce  a  kind  of  mode  coupling  between 
electron  states  in  the  hydrogen  atom  that  is  quite  similar  to  Bragg  scattering  (including  modulation  by  larger  scales). 
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can  use  the  WKB  approximation  and  the  assumption  that  this  is  a  Markov  process  to  show  the  incoherent 
wave  is  proportional  to  the  coherent  wave: 

8z  x  if  {z) 


and  so  incorporate  this  term  into  an  equation  of  motion  involving  the  coherent  field  alone  [64,  65] .  This 
class  of  scattering  events  is  relatively  well  understood*  (at  least  in  the  present  context  where  the  density 
never  changes),  and  the  related  term  is  from  time  to  time  dropped  from  our  discussion  below  with  the 
understanding  that  the  coherent  field  calculated  neglecting  this  term  (crudely  speaking)  has  to  be 
multiplied  by  an  exponentially  decaying  envelope1*. 

The  ellipsis  in  Eq.  (21)  subsumes  a  new  cross-term  between  the  incoherent  field  8%  and  the  new  term 

LI /8 k0  .  This  provides  a  modest  correction  to  the  Bragg  scattering  loss  term,  and  it  will  also  be  neglected. 

Also  subsumed  in  the  ellipsis  are  fourth-order  and  higher  terms,  and  these  are  also  dropped  from  the 
present  discussion. 


The  fourth  term  in  Eq.  (21), 


(f-)nn] 

2  dz2 


(z)> 


involves  a  kind  of  “smearing  out”  to  the  rough  interface  as  far  as  the  coherent  wave  is  concerned.  This  is 
the  term  that  produces  the  well-known  form  of  the  Lamb  shift  effect  when  the  second  derivative  is  a  8  - 
function  -  as  opposed  to  the  more  obscure  aspect  of  the  Lamb  shift  effect  that  emerges  from  the  term 

which  is  examined  next  in  Section  3.2. 3. 3. 


3.2.2.3  Stochastically  Averaging  the  New  Term  Introduced  by  the  Foldy-Wouthuysen 
Transformation 

Having  examined  the  incoherent  scattering  and  the  smearing  terms  in  Eq.  (21),  we  still  need  to 
evaluate  the  new  term  (ji/8k0}(%) .  Assuming  that  the  range  dependence  /  (v)  is  independent  of  the 
depth,  this  is  easy  to  do.  We  start  with 


ju(x,z)  =  \ju]{z~f{x)). 


The  new  term  is  proportional  to 


4.,=/ 


32M 


dz2 


-f 


Z=Zo -/ 


dz 


z=z»-/ 


(22) 


*  For  a  good  discussion  of  the  parabolic  equation  and  diffuse  scattering  induced  by  volume  inhomogeneities,  see  the 
classic  text  by  Ishimaru  [64].  A  discussion  similar  to  the  one  being  pursued  here,  but  for  rough  surfaces  in  the 
context  of  normal  mode  theory,  is  presented  in  a  classic  paper  by  Kuperman  [65], 

u  By  no  means  is  it  being  claimed  here  that  diffuse  scattering  is  small,  or  that  it  would  be  trivial  to  graft  it  to  a 
solution  later.  However,  the  goal  below  will  be  to  examine  the  newly  introduced  physics  in  isolation  so  that  its 
potential  impact  in  the  context  of  underwater  acoustics  can  be  determined.  This  is  not  unlike  atomic  physics,  where 
the  set  of  hydrogen  eigenstates  (including  the  Lamb  shift)  is  examined  independently  of  any  consideration  of 
transition  probabilities,  induced  for  example  by  the  same  vacuum  fluctuations  as  those  that  cause  the  Lamb  shift  and 
similar  in  nature  to  the  diffuse  scattering  found  in  acoustics. 
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Since  5[//]/3z  is  here  assumed  to  be  continuous,  we  can  Taylor  expand  it: 


dz 


z»-/ 


dz 


-f 


dz2 


+  0(/2) 


to  get 


Similarly, 


=  f 


dz2 


Z=J0 -/(*) 


Z[m] 


dz 


j  2 


dz2 


+  0(/3). 


Zo  J 


dz2 


«=-0  /(■') 

Substituting  this  in  Eq.  (24)  and  averaging  yields 


dz2 


+0  (/). 


«u=</j> 


dz2 


ffM] 


z=z0 


dz 


Af)W 


+  0(f). 


(23) 


(24) 


(25) 


By  construction,  \fj  =  0  (the  distortion  is  assumed  to  be  symmetric).  Furthermore,  writing 
(/  (x\ )  /  (jc2))  =  |  dk  S  (k  j  elklx'~X2}  (this  form  follows  from  the  assumption  that  the  statistics  of  the 
distortion  are  range-invariant),  and  taking  derivatives,  it  is  easy  to  show  that  ^/  ■  /  ^  =  —  ^/2^.lt  follows 
that  there  is  a  cancellation  to  force  (//)  =  0  .  When  we  allow  the  range  dependence  to  vary  with  the 

depth  ( /  =  /  (.r,  z) ),  then  the  calculation  is  a  bit  trickier,  but  once  again  we  find  that  as  long  as  the 

sound  speed  and  its  gradient  are  continuous,  the  average  (ju'j  is  zero.  The  details  are  provided  in 
Appendix  E. 


Finally,  note  that  if  d  [z/]/r)z  or  [A]  have  jumps  (i.e.,  there  is  a  singularity  in  transverse  space),  it  is 
still  possible  to  perform  a  Taylor  series  expansion  everywhere  except  at  the  discontinuity.  Rather 
obviously,  it  is  however  not  legitimate  to  use  a  Taylor  series  to  generate  8  -functions  that  were  not  in  the 
function  to  begin  with  (effectively,  the  Taylor  series  generates  the  principal  value  of  the  derivatives). 
Thus,  the  bona  fide  8  -functions  generated  by  LI  do  not  cancel  (recall  that  the  FW  transformation  is  not 
an  expansion  based  on  the  Mean  Value  Theorem,  but  rather  a  canonical  transformation  of  the  wave 
equation),  and  we  are  left  with  8  -functions  (or  S'  -functions)  at  the  discontinuity,  even  after  performing 
the  stochastic  averaging  procedure  outlined  above.  Eater  in  Sections  4. 1  to  4.4,  when  we  examine  the  case 
of  [//]  discontinuous,  we  take  a  much  more  detailed  look  at  what  happens  when  we  distort  the  world  line 

of  a  singularity  in  transverse  space  to  create  a  rough  surface.  For  the  time  being,  let  us  familiarize 
ourselves  with  the  concepts  involved  by  taking  a  rudimentary  look  at  the  milder  singularity  where  the 
derivative  d  [//]/5z  is  discontinuous  (i.e.,  a  cusp  in  the  sound  speed  profile).  Then  to  illustrate  this  result, 

and  in  the  process  further  justify  the  line  of  argument  that  led  to  it,  the  deep  connection  between  this 
phenomenon  and  the  quantum  Famb  shift  will  be  explored  by  examining  a  “toy  model”  of  the  atomic 
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Lamb  shift  that  essentially  extends  the  sound  speed  cusp  in  two-dimensional  depth-range  space  so  that  it 
can  be  embedded  into  four-dimensional  space-time. 

3.2.3  A  Cusp  in  the  Sound  Speed  Profile 

This  section  considers  the  cusp  in  the  sound  speed  profile.  The  boundary  conditions  are  evaluated  in 
Section  3.2.3. 1  and  then  in  Section  3. 2. 3.2  they  are  projected  down  to  the  line  z  —  0  in  preparation  for  the 
stochastic  averaging  performed  in  Section  3.2. 3. 3.  The  step  in  fl  is  averaged  in  Section  3.2. 3.4  and  the 
smearing  term  is  simplified  to  a  familiar  form  in  Section  3. 2. 3. 5.  Finally  in  Section  3. 2. 3. 6  the  boundary 
conditions  are  converted  to  contact  potentials.  This  calculation  forms  the  prototype  for  many  that  follow, 
and  so  it  is  considered  explicitly  here  rather  than  being  relegated  to  an  appendix  as  are  some  comparable 
calculations  later  in  this  study. 

3.2.3.1  The  Boundary  Conditions  Along  a  Cusp 

Figure  6  below  illustrates  a  profile  [//]  (z)  that  has  a  cusp  at  the  origin  z  —  0 . 


Fig.  6  —  The  basic  cusp  at  z  =  0 .  We  expand  in  a  Taylor  series  to  keep  track  of  the 
discontinuities  of  derivatives  of  the  sound  speed  function  [//] .  In  this  report  the  two  sides  of  an 
interface  are  labeled  II  and  I,  with  the  positive  z-axis  pointing  from  II  into  I. 
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We  use  a  Taylor  series  expansion  to  0  (k  j  to  break  [//]  into  a  part  that  will  induce  discontinuities 

and  a  5  -function  in  jil ,  and  a  part  that  contributes  to  the  continuous  part  of  Ll  (which  as  we  have  just 
seen  averages  to  zero).  Therefore,  we  begin  with 


[//](z)  =  [//](°)  +  z0(z) 


3[/h] 


dz 


+  zQ(-z) 


3[/b/] 


z2  ,  ^  32  [///l 

+T0<^ 


z=0 

2 


dz 


z= 0 


z=0 


■°(U 


z= 0 


where  0(z)  is  the  Heaviside  step  function 

0(z)  =  - 


1  z>  1 

0  z  =  0 


(26) 


(27) 


Note  that 


3k] 

3  [  A//  ] 

32k] 

and 

z=0 

32k/] 

dz 

o’  dz 

z= 0 

o’  3r 

z= 0 

dz2 

z=0 


are  now  treated  as  constants. 

Using  the  fact  that  zS  (z)  =  0 \z2 5 (z)  =  0  ,  we  have 


3k 


dz 


32k 


=  0(2) 


3k] 


+z0(z) 


dz  1,-0 

32k] 


+®H) 


3  \Mh  ] 


dz2 


=  S(z) 


+0(2) 


3z~ 

3k] 


dz 

+  z©(-z) 


z= 0 

32k/] 


z=0 


3  [A//  ] 


dz 

v 

32k] 


z-o 


9z 


dz~ 


+  0(-z) 


dz 


z=0  J 

32k /] 


z=0 


z=0 


z=0 


Now,  setting  z  =  Z0~  f  and  substituting  into  Eq.  (22),  we  have 


k  =f2s{z0-f) 


( 3k] 

3  \_Mii  ] 

\ 

l 

o  dz 

z=  0 

z=0/ 

+f 


-f 


Q(Zo-f) 


32k] 


dz 


+©K,+/) 


32k/] 


z=0 


dz 


z=0  J 


0(Zo"/) 


3k] 


3z 


+  ®(Zo-/) 


3  [/6/  ] 


J=0 


3z 


z=0  J 


+  a  part  that  is  continuous 


(28) 


(29) 
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Substituting  into  the  “quasi-first  order”  Hamiltonian  H  =k0  +  / 1  —  X/Sk^  given  by  Eq.  (20),  and 
dropping  the  “0”  in  z0  ,  the  basic  PE  —idx/dx  =  H%  (Eq.  (18))  gives  us 


Yli 

2k0 


+ 


S  kn 


d  \Mii  ] 

\ 

{  dz 

o  dz 

z= 0 

z=oJ 

8{z-f)x  + 


terms  that  are  continuous 
or  (at  worst) 

proportional  to  a  step  function 


(30) 


Here  we  find  a  8  -function  “potential”  in  the  PE  much  like  that  found  in  the  Kronig-Penney  model  known 
from  quantum  mechanics  (e.g.,  see  Ref.  66).  Let  us  follow  the  precedent  from  that  context  and  evaluate 

c  f+£  j 

this  contact  potential  by  taking  an  infinitesimal  integral  in  the  transverse  space  dz •••  (we  take  a 

Jf-e 

much  closer  look  and  generalize  the  result  in  Section  3.3.3).  The  terms  that  are  continuous  or  proportional 
to  the  step  function  potential  do  not  contribute  to  the  infinitesimal  integration  and  so  they  drop  out. 
Following  the  conventions  in  Fig.  7,  this  leaves  us  with  the  boundary  conditionsv 


dZ 

dz 


dX, 

+  f2 

d[jUn] 

,=/  dZ 

*-/  4 

dz  dz 

x(f) 


x„(f)=x,(f)=x(f) 


(31) 


Fig.  7  —  By  convention,  in  this  report,  the 
z-axis  always  points  from  side  II  into  side  I. 
Integration  is  also  from  side  II  to  side  I. 
Note  that  here  the  horizontal  axis  is  in  the 
downrange  direction,  while  in  Fig.  6,  the 
horizontal  axis  measures  the  sound  speed 
parameter  ju(z). 


3.2.3.2  Effective  Boundary  Conditions  Aong  the  Line  z  =  0 

Now,  convert  the  boundary  conditions  on  X  and  dx/dz  at  z  —  f  to  effective  boundary  conditions 
down  at  z  =  0  by  performing,  independently  on  each  of  the  two  sides  of  the  interface,  Taylor  series 
expansions  of  the  function  X  and  dx/dz ,  respectively.  Beginning  with  X  -  we  have 


z(o)=z(f-f)=z(f)-f^- 


f2  d2Z 


Z=f 


2  a  Z 


■°(/T 


2=/ 


v  Note  that  continuity  of  guarantees  that  it  and  <)% /tix  are  free  from  8  -functions.  This  in  turn  guarantees  that  the 
related  terms  in  Eq.  (30)  are  indeed  free  from  8  -functions  as  asserted. 
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and  exactly  the  same  for  %n  .  Subtracting  the  equations  for  X,  and  %n  gives  us 


ZI(0)-Zu(0)=-f 

=  f_ 
2 


dz 


d2Z z 


~dXn 

5 

f 

+  — 

fd2X, 

VXn 

5 

+  0  (f3) 

*=/  dz 

z=/ J 

2 

^  dz 

Z=f 

dz2 

*=/  J 

\J  ) 

(32) 


dz2 


d2Zn 


Z=f 


dz2 


+ 


o(/3) 


z=/y 


Here  we  have  used  the  fact  that  d% / dz  is  continuous  to  0(f 2  j .  Similarly,  we  see  that  %  is  continuous 
on  the  surface  Z  =  f  ,  so  the  tangential  derivative 

f.^ce^  +  Z^ 

3x  dz 

is  continuous.  Since  d% /dz.  is  continuous  to  0(f2\ ,  d% / dx  at  the  surface  is  continuous  to  O  [  /  5  j , 

and  from  the  PE  so  must  H  X  .  Evaluating  HIXI  ~  HnZn  =  0 ,  and  noting  that  X  and  U  are  continuous 
at  the  interface  Z  =  f  ,  we  have 


d2Z, 


dz2 


d2Zn 


z=f 


dz2 


=  0, 


z=f 


and  so  Eq.  (32)  gives  us 


^(0)-^(0)  =  0+o(/3). 

Similarly,  taking  a  Taylor  series  expansion  of  the  first  derivative  of  X  > 


(33) 


(34) 


dX, 

dX„ 

_dX, 

dXn 

-f 

z=f 

fd2X, 

d2Xn 

\ 

dz 

z=o  dz 

z=o  dz 

Z=f  dz 

f  dz2 

z=f 

Z=f  > 
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(d3X, 

d3Z„ 

\ 

l*s 

r  dz3 

z=f 

z=f  J 
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°(/3) 


(35) 


f2 


dz  dz 


r(o)  +  Y 


d3X , 


dz 3 


d3X„ 


z=  0 


dz3 


+ 


0(f3). 


z=0j 


3.2.3.3  Averaging  the  Effective  Boundary  Conditions 

Now  adapt  a  technique  originally  developed  by  Kuperman  for  the  slightly  different  context  of  rough 
surfaces  in  normal  mode  theory  [65].  In  Eqs.  (32)  and  (35),  once  again  break  the  wave  function  into 

coherent  and  incoherent  parts:  x\—0  =  (z)(®)  +  &X  -  and  average  these  boundary  conditions.  This  gives 
us 
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6r,>(oH*„>(o)=o+°  (/>) 


9  b/) 

9  (Zn) 

_<T) 

ro 

i _ 

dz 

z-o  9z 

4 

z=0 

dz  dz 

(z)(°) 


+  (f2) 

f93b/) 

9  '(Zn) 

\ 

2 

l 

,0  9z3 

z=0  J 

+o(/>) 


(36) 


Note  that  we  have  effectively  replaced  the  actual  boundary  condition  at  z  =  f  with  an  effective  boundary 
condition  at  z  =  0 .  Thus,  we  have  effectively  moved  the  8  -function  in  Eq.  (29)  down  to  the  z  =  0  line. 
As  described  in  Appendix  E,  to  ensemble  average  away  from  the  interface,  we  must  similarly  express  the 
(in  this  case  very  well-behaved)  environmental  parameter  ft  in  terms  of  a  Taylor  series  that  measures  its 
departure  from  its  unperturbed  values.  Obviously,  the  same  applies  to  the  part  of  ft  that  is  continuous  at 
the  interface. 

3.2.3.4  Averaging  the  Step  Functions  in  ft 


It  only  remains  to  do  the  same  for  the  step  functions  0(±(z  —  /))  found  in  Eq.  (29).  To  move  such  a 
step  function  back  to  the  z  =  0  line,  use  a  Taylor  series  to  project  the  coefficients  of  the  step 
0(±(z  —  /))  down  a  distance  — /  .  This  effectively  extends  the  half-spaces  down  to  the  z  =  0  line, 

with  the  Taylor  series  analytically  extending  the  coefficients  of  the  step  0(±(z  —  /))  as  necessary.  This 

changes  the  location  of  the  step  without  affecting  the  function  outside  the  interval  z  e  [(),  /] .  For  the  step 
functions  in  Eq.  (29),  this  gives  us 
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z=oy 
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(37) 


Note  that  like  the  continuous  part  of  ft ,  this  averages  to  zero: 


=  0. 
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3.2.3.5  Simplifying  the  Smearing  Term 

Now,  let  us  look  at  the  smearing  term  in  the  boundary  condition  for  given  by  the  second 

part  of  Eq.  (36).  This  term  involves  the  boundary  condition  for  the  third  derivative  33(/Jf)/3x3 .  To 
obtain  this,  consider  Eq.  (21), 

-^  =  [H]{z)(z)+o((fSZ),(f),(fSz),{f2)), 

and  take  the  transverse  derivative  Vr  =  d/dz  of  this  stochastic  Helmholtz  equation  on  either  side  of  the 
interface: 


—l 


(38) 


and  evaluate  the  second  equation  on  both  sides  of  the  interface  and  subtract.  Note  that  ,  Vr  and 

consequently  its  transverse  3Vy,  (%)/dx  only  violate  continuity  at  O  or  higher. 

Therefore  subtracting  Eq.  (38)  evaluated  just  inside  the  two  regions  gives  us 


0-V?te)l<;to'>lj-Mvdft1L-vdft,]L)6)+o(</»).(r)) 
v;  {X,  )L  -  v;  (X„  )|.  =  (Vr  \ft,  ]|  -vr  [//„  L  )(x)  +  o  «/*).</’)), 


(39) 


and  Eq.  (36)  becomes* 

(i,)(0)-(x„)(0)  =  ° +o(/s) 
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dz 
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+ko(f2) 

3k] 

3kJ 

0  V  / 
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n  3z 

z= 0 
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(40) 


<r>L+0(/!) 


w  Note  that  we  have  demonstrated  that  here  in  in  quasi -first  order  (i.e.,  0(2,2) )  theory  -  or  for  that  matter  in 
ordinary  first  order  0(2)  theory  -  jumps  in  the  third  derivative  d3  (x)/dz3  are  generated  by  jumps  in  the  gradient 
of  [//]  (i.e.,  d\tu\jdz  ).  Later,  when  we  relax  the  condition  that  [//]  be  continuous,  then  will  see  that  jumps  in  the 
sound  speed  [ //  J  similarly  lead  to  jumps  in  the  second  derivative  of  the  field  d2  (x)/c)z2  .  Ultimately,  these  jumps 
in  the  higher-order  derivatives  of  the  field  provide  the  underlying  physics  behind  the  contact  potentials  that  appear  in 
perturbation  theory.  Interestingly,  in  the  context  of  first  order  0(2)  and  quasi-first  order  (i.e.,  0(2,2) )  theories, 
these  contact  potentials  in  turn  spawn  effective  boundary  conditions  on  d(%)/dz  and  later  also  in  (%'j .  This  kind  of 
transference  of  discontinuity  from  higher-order  down  to  lower-order  derivatives  is  examined  further  in  Section  4.2.2. 
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Note  that  there  are  no  Bragg  scattering  terms  (i.e.,  proportional  to  8/ )  in  the  boundary  conditions  (Eq. 
(40)).  If  we  assume  Gaussian  statistics,  then  the  expectation  values  of  odd  powers  of  the  surface 
displacement  function  /  and  its  derivatives  are  zero,  and  consequently  the  error  terms  in  the  boundary 
conditions  (Eq.  (40))  are  in  fact  fourth-order. 

3.2.3.6  The  Effective  Boundary  Conditions  as  a  Contact  Potential 

In  the  quasi-first  order  (i.e.,  O^A,A ij)  theory  currently  being  used,  the  lead-order  derivative  in  the 
kinetic  term  is  V).  ^)/2 k0  .  This  allows  us  to  replace  boundary  conditions  (Eq.  (40))  by  adding  a  contact 
potential  to  the  unperturbed  sound  speed  function  [//]—>[//]  +  [ /^]conlacl ,  where 


[M] 


contact 


<y 

3[a„] 

y) 

[  8*o 

dz 

o  dz 

z= 0 

z=0_ 

2 

dz 

3[/c] 


dz 


-iV 


z= 0  J 


(41) 


The  (/’’j  contribution  to  this  8  -function  happens  to  be  the  same  as  if  we  added 


2  dz2 


to  /LI ,  so  we  can  interpret  this  term  as  representing  the  smearing  of  the  sound  speed  profile  in  the 
transverse  direction  by  a  root  mean  distance  and  average  displacement  (f)  =  0 .  The  part 

of  the  8  -function  came  from  the  new  term  contributed  by  FW  transformation,  and  it  can  be  written  as 


2 


r  .  y 

i 

v2k0  y 


As  discussed  in  Appendix  A,  this  implies  that  the  new  term  contributed  by  the  FW  transformation 
corresponds  to  the  smearing  of  the  potential  in  range-domain  by  virtual  fluctuation  of  size  i/2k0  .  The 
imaginary  factor  occurs  because  the  phenomenon  is  evanescent  in  the  downrange  direction. 

Returning  to  the  very  important  result  (Eq.  (41)),  note  that  we  could  have  obtained  the  part 
proportional  to  ^/'2  ^  immediately  by 

■  taking  Eq.  (22),  and 

■  treating  the  term  proportional  to  j  as  a  principal  value  plus  a  bona  fide  8  -function,  which  to 
this  order  can  simply  be  translated  down  to  the  origin, 

■  taking  the  term  that  is  merely  discontinuous,  but  free  of  8  -functions  (i.e.,  the  one  proportional  to 

/ ),  and  expanding  it  in  a  Taylor  series,  being  sure  to  keep  only  the  principal  values  of  any 
derivatives  generated  by  the  Taylor  series, 

■  and  finally  taking  an  ensemble  average  and  noting  that  ( f  ”  \  +  (/'/)  =  0  - 
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Also  note  that  we  could  have  immediately  obtained  the  transverse  smearing  term  proportional  to  lf°')  by 


/ 


adding  v  /2  and  this  time  accepting  any  8  -functions  generated  (this  particular  shortcut  will 

work  until  we  introduce  a  density  jump  in  Section  4).  For  our  next  example,  we  will  exploit  some  of  these 
shortcuts. 


3.2.4  A  “Toy  Model”  of  the  Atomic  Lamb  Shift 

3.2.4.1  Transferring  the  Results  for  a  Cusp  in  Two  Dimensions  to  a 
Central  “1/r”  Potential  in  Four-dimensional  Space-time 

Now,  let  us  illustrate  the  connection  between  the  cusp  in  a  sound  speed  profile  and  the  quantum  Lamb 
shift.  To  do  so,  we  will  consider  a  crude  “toy  model”  of  the  atomic  Lamb  shift  that  will  not  only 
reproduce  a  well  known  non-relativistic  model  of  the  atomic  Lamb  shift  but  also  add  some  very 
interesting  new  physics  to  it.  Begin  with  a  Klein-Gordon  equation  with  a  scalar  potential: 


a2 

dr 


c2V2  +  col(l  +  U  ( r,t )) 


</)  =  0. 


(42) 


Now,  compare  Eq.  (42)  to  the  Helmholtz  (Eq.  (2)): 


f  d2 

ydx2 


+  V2  +  &2n2 


(x,Rt) 


A  =  0 


(43) 


and  make  the  identifications 

col  <=>  kl  to.r  V2<^=>-c2V2  n2  o  l  +  U(r,t).  (44) 

Note  that  ti(O0  =  Rest  energy  =  m0c2  .  The  fourth  identification  in  Eq.  (44)  implies 


M  =  - 


I  -n 


/^effective 


1-(1 +u)_  u 

2  •  enective  ^  ^ 

Now,  the  expansion  parameter  for  the  FW  transformation  goes  as 


V2  -c2V2 

A  =  —  -k0ju^>  > lnew  = - 

2  kn  2  con 


'  ^O^effective 


2\t2 

c  V 
2  (Qn 


U 


-+C0o~ 


(45) 


and  the  corresponding  PE  (in  this  case,  a  Schrodinger  equation)  is 


±i^= 

dt 


8  con 


<P 


co0+- 


-c2V2 
2  con 


-  +  0)q  —  + 


^  i  /^effective 


8  con 


(p 


(46) 
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Now,  make  the  replacement  (f)  =  e+uo"'(p  to  remove  the  rest  energy  CO0 .  Also,  multiply  through  by  ti  and 
use  c2/a>0  -  fij m0  to  get 


dt 


-n2v2 

2  mn 


u  hfi, 


+  h  (Or. - h 

0  2 


effective 


8  con 


< P . 


and  this  time  choose  the  top  branch  to  match  conventions  in  quantum  mechanics.  Make  the  identification 


V  —  fld»o  2  ^ ^Ineffective 


to  get 


dt 


r-tfV2 

V  2mo 


+  V  — 


j_aV 

8^  9t2  y 


^  +  - 


(47) 


Now,  let  us  consider  the  standard  radial  potential  corresponding  to  an  electric  field  around  a  point  source 
(typically  the  nucleus  of  an  atom): 

V  (r)  =  — 
r 

(where  r  =  |r  | ).  Impose  second  quantization  (vacuum  fluctuations)  by  perturbing  the  spatial  coordinate11: 

F— >r  +  Jr(t)  sothat  V  (r)  (r  +  Sr  (f)) . 

Now  the  world  line  of  the  nucleus  is  a  rough  interface  (Fig.  8)y. 


Fig.  8  —  The  world  line  of  the  nucleus  is  a  rough  surface 


x  We  could  have  used  the  more  general  Sr  (r ,1) ,  and  applied  the  basic  derivation  given  in  Appendix  E  to  evaluate 
this  problem.  We  would  once  again  determine  that  adding  the  spatial  dependence  in  Sr  adds  a  great  deal  of 
complexity  to  the  calculation  without  adding  much  new  to  the  problem  or  changing  the  result. 

y  Incidentally,  it  is  primarily  the  very  light  electron  that  is  advected  by  the  fluctuations,  and  not  really  the  very  heavy 
nucleus.  However,  in  the  current  context  it  does  not  really  matter  which  is  moving,  since  only  the  relative  position  is 
relevant. 
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Taking  derivatives,  we  have 


dV 

dt 

d2V 

dt2 


Sr  ■  VV (T  +  Sr) 

ST  ■  VV(T  +  ST)  +  S7-VdV(r  +  Sr^ 

dt 

Sr  ■  WV[T  +  Sr)  +  Sr  ST  :  VVV  ( r  +  Sr) 


Now  expand  in  ST  to  0 ( Sr  )  using  the  shortcuts  we  just  developed  by  examining  the  cusp  (PV  = 
principal  value) 

2*7 

— -  =  ST  -WV  (T)  +  ST  ST  :  PV^VVP  (r  )J  +  ST  ST  :  VVV  (r), 
and  average  (noting  that  (ST\  =  0) 


1 d2V] 
ydt2  > 


=  (STST^j:  PV[VVP  (r)J  +  (ST ST') :  VVV  (F) . 


Sr  Sr)  =  (  vv),  where  ST  =  v  is  the  velocity  of  the  displacement.  Furthermore, 


Now  —{SrSry- 

assume  the  displacement  follows  Gaussian  statistics;  specifically,  that  the  motions  in  the  x,y,z 
directions  are  not  correlated  to  obtain 


(vv)  =  y  Sir 


(To  see  where  the  factor  of  3  came  from,  take  the  trace  of  both  sides.)  This  leads  to 


From  the  basic  identity 


-4 ttS(T)  , 


(48) 


(49) 


we  have  PV^VV(r)J=0  and  V2V  (r )  =  V()  [— ■ 47TC>(r)J  .  Thus,  substituting  Eq.  (49)  into  Eq.  (48)  and 
then  the  result  into  Eq.  (47),  we  find  that  the  new  term  introduced  by  the  FW  procedure  is 

1  -  1 


Via  the  uncertainty  principle,  we  have  that  the  lifetime  t0  of  the  virtual  particle  pair  (or  equivalently 
fluctuation)  is  given  by 
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At 


max 


—  Iq  — 


1 

2co0 


and  so  the  new  term  becomes 


vv. 


(51) 


To  within  a  sign,  this  is  just  the  smearing  out  of  the  Coulomb  potential  due  to  virtual  fluctuations  (i.e., 
particles)  in  the  time  domain  that  travel  an  average  distance  ^  ( Artime  fluctuation )  ~  ^  during  their  lifetime. 

The  negative  sign  occurs  because  time -domain  fluctuations  are  evanescent.  Since  the  frequency  2 CO0 

corresponds  to  the  rest  mass  of  the  “electron-positron”2  pair,  it  is  very  large.  There  is  no  reason  to  expect 
the  velocities  of  the  virtual  particles  to  be  relativistic,  and  so  we  would  not  a  priori  expect  Eq.  (50)  (or 
equivalently  Eq.  (51))  to  be  very  large  in  this  context.  The  precise  nature  of  the  fluctuations  is  determined 
by  quantum  field  theory,  which  is  outside  the  scope  of  this  report. 


Now,  let  us  add  the  smearing  term.  We  could  just  replace  ^  • 3  with  /{  ■  V  V  ,  and  use  the 

shortcut  describe  above.  However,  adapting  the  relevant  term  from  Eq.  (41)  directly  provides  an 
opportunity  to  further  explore  the  relationship  between  the  atomic  problem  and  the  classical  field.  In  the 
M  ’  radial  transverse  space  with  a  point  singularity,  the  infinitesimal  integral 


3  k,] 


dz 


z=0 


dz 


2=0 


3  2n 

dz2 


gets  replaced  as  follows: 


3k] 


3kJ 


dz 


z= 0 


dz 


lz=0 


Sphere 
of  radius  £ 
centered  at  0 


r/3rV2V  . 


(52) 


Now,  use  Stokes’  theorem  to  evaluate  this  infinitesimal  integral33: 


lim  (b  d  rV  V  =  lim 

£ — >0  J  £ — >0 

Sphere  Surface  of  Sphere 

of  radius  £  of  radius  £ 

centered  at  0  centered  at  0 


<J>  rfS(n-VV) 


=  <j)  dQ.  V0 


all 

directions 


” 

( 

£  M 

2  I 

r 

r 

—r  • 

~~2 

V  )\ 

=  ~4/rVn 


(53) 


z  The  term  is,  of  course,  used  very  loosely  here  since  these  objects  are  spinless  in  our  toy  model. 

aa  The  differential  element  of  solid  angle  clil  can  be  parameterized,  for  example,  by  d Q  =  sin  (O)dOd(p ,  where  8 

is  the  azimuth  angle  and  (p  the  polar  angle. 
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This  leaves  us  with  the  identification 


dz 


3k„] 


z= 0 


dz 


~4nV0, 


z=  0 


and  replacing  ^  with  (Sr’'^  j 3,  we  have  from  the  second  term  on  the  right-hand  side  of  Eq.  (41) 


SV, 


transverse  smear 


=  S(z) 


Sr 


[-4ttV0] 


Sr 


vv, 


(54) 


which  precisely  agrees  with  Welton’s  qualitative  description  of  the  Lamb  shift  [62]  (see  also  Ref.  57, 
page  59  for  a  clear  exposition  of  Welton’s  theory).  Note  that  overall,  we  have  generalized  Welton’s  model 
by  adding  time -domain  smearing  -  Eq.  (51)  -  to  his  smearing  in  the  transverse-domain  (three- 
dimensional  space)  -  Eq.  (54).  This  is  the  contribution  of  the  Foldy-Wouthuysen  transformation  in  this 
context. 

3.2.4.2  Insights  Concerning  the  Two  Contributions  to  the  Lamb  Shift 

Thus,  the  “toy  model”  used  here  nicely  confirms  the  identification  of  the  acoustic  phenomenon  given 
in  Eqs.  (40)  and  (41)  as  a  classical-field  Lamb  shift.  We  have  also  discovered  that  there  are  two  types  of 
Lamb  shift:  a  downrange  Lamb  shift  and  a  transverse  Lamb  shift.  Welton’s  model  of  the  Lamb  shift  only 
includes  the  latter  effect.  The  [time/range] -domain  smearing  (or  downrange  Lamb  shift)  is  new  here.  Note 
that  it  occurs  because  the  transverse  space  oscillations,  which  are  imposed  by  hand,  automatically  induce 
secondary  oscillations  in  the  time/range -domain,  and  that  our  field  theory  automatically  generates  this 
effect. 

An  interesting  distinction  between  the  downrange  Lamb  shift  and  the  traditional  Lamb  shift  emerges 
when  we  consider  the  volume  scattering  problem.  As  noted  in  the  discussion  following  Eq.  (25),  for 
volume  scattering  (i.e.,  when  the  potential  does  not  contain  singularities),  the  downrange  Lamb  shift 
invariably  disappearsbb.  (Equation  (25)  addresses  the  acoustic  problem,  but  it  is  trivial  to  transfer  the 
result  over  to  the  “toy”  atomic  problem.)  This  is  not  necessarily  true  of  the  traditional  or  transverse  Lamb 

shift.  For  the  radial  potential  V0/r  in  R3,  V2V  is  invariably  0,  except  at  the  origin,  so  there  is  no 

smearing  of  the  potential  away  from  the  origin.  Thus,  in  the  atomic  problem,  there  is  also  no  volume- 
induced  transverse  Lamb  shift  (at  least  at  lowest  order).  On  the  other  hand,  for  an  arbitrary  sound  speed 

function  [//] ,  it  is  perfectly  possible  for  the  term  d2ju/dz  to  be  nonzero,  and  so  we  can  see  from  Eq. 

(21)  that  we  can  have  a  nonzero  transverse  smearing  term,  even  in  the  absence  of  any  singularities.  The 
transverse  (or  traditional)  Lamb  shift  comes  from  this  term,  and  so  it  follows  that  fluctuations  in  a  volume 
can  in  general  generate  a  transverse  (or  traditional)  Lamb  shift  effect  in  the  acoustic  problem. 

Whether  the  downrange  Lamb  shift  or  the  transverse  Lamb  shift  dominates  depends  on  whether  the 
interface  is  slowly  oscillating  while  eventually  achieving  a  fairly  large  amplitude  (on  the  scale  of  a 
wavelength),  or  rapidly  oscillating  with  a  more  modest  amplitude.  If  the  fluctuations  fall  into  the  second 
category,  then  the  downrange  Lamb  shift  will  dominate.  It  is  in  principle  possible  that  in  some  classical 
environments,  fluctuations  at  the  Bragg  wavelength  may  fall  into  the  second  category.  However,  a  typical 
scenario  in  underwater  acoustics  is  assessed  in  Section  5.4,  and  there  the  transverse  Lamb  shift  still  turns 
out  to  be  the  dominant  effect. 


bb  However,  recall  that  the  new  FW  term  also  induces  a  small  correction  to  diffuse  scattering,  which  was  relegated  to 
the  ellipsis  in  Eq.  (21).  This  effect  need  not  disappear  in  the  volume  scattering  problem. 
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If  the  fluctuations  fall  into  the  first  category,  then  we  would  expect  the  transverse  Lamb  shift  to 
dominate.  As  alluded  to  above,  this  is  the  case  for  the  atomic  Lamb  shift.  The  reason  is  that  imposed 
fluctuations  Sr  are  caused  by  vacuum  fluctuations  of  the  electromagnetic  field  (i.e.,  virtual  photons),  and 
not  by  virtual  electron-positron  pairsy.  The  lifetime  of  a  massless  virtual  photon  can  be  much  longer  than 
t0 ,  the  lifetime  determined  by  the  uncertainty  principle  for  a  virtual  electron-positron  pair  with  rest  mass 

2 m0.  Thus,  Sr  ~  fphotonv  can  be  much  larger  than  Artime  fluctuation  =  t0v  .  (Note  that  the  electron-positron 

pair  is  stationary  aside  from  the  velocity  of  the  field  fluctuations  on  which  it  (along  with  the  nucleus)  is 
advectedy.)  Indeed,  Welton’s  model  of  the  Lamb  shift  falls  a  little  short  of  the  correct  answer,  and  the 
time -domain  Lamb  shift  has  the  wrong  sign  to  narrow  the  difference00.  Therefore,  the  time-domain  Lamb 
shift  must  be  pretty  small. 

3.2.4.3  Limitations  of  the  “Toy”  Model  for  the  Hydrogen  Atom 

The  failure  of  our  semiclassical  “toy  model”  to  fully  account  for  atomic  Lamb  shift  should  come  as 
absolutely  no  suiprise.  There  are  several  layers  of  approximation  between  this  model  and  a  realistic 
hydrogen  atom.  A  realistic  hydrogen  atom  is  described  by  Quantum  Electrodynamics  (QED).  There  is  a 
great  deal  going  on  in  QED  that  lies  well  beyond  the  scope  of  any  model  that  approximates  second 
quantization  as  the  simple  advection  of  space  by  vacuum  fluctuations^.  Our  simple  model  for  second 
hydrogen  atom  in  the  absence  of  second  quantization  would  be  the  Dirac  equation,  which  describes  a  field 
with  Vi-integer  spin.  Rather  than  performing  the  FW  transformation  directly  on  the  Dirac  equation,  we 
first  replaced  the  Dirac  equation  with  the  Klein-Gordon  equation  describing  a  scalar  (spinless)  field00.  As 
described  in  Chapter  9  (pp.  202-203)  of  Ref.  57,  such  a  field  would  describe  a  pionic  atom.  Replacing  the 
hydrogen  atom  with  a  pionic  atom  therefore  constitutes  our  second  approximation.  Now,  it  is  important  to 
note  that  Eq.  (42)  does  not  even  provide  a  realistic  model  of  a  pionic  atom.  A  realistic  pionic  atom  would 
have  a  diagonal  matrix  potential  imposed  on  the  state  space  equation,  while  Eq.  (42)  involves  a  scalar 
potential  directly  in  the  original  Klein-Gordon  equation.  Replacing  the  diagonal  matrix  potential  with  a 
bona  fide  scalar  potential  constitutes  our  third  major  approximation. 

The  problem  loses  some  of  its  richness  with  each  of  the  three  approximations  above.  Quantum 
Electrodynamics  is  far  beyond  the  scope  of  the  discussion  here,  and  we  will  not  be  able  to  go  further  into 
this  fascinating  topic.  The  Dirac  equation  is  somewhat  richer  than  the  Klein-Gordon  equation.  In  a 
realistic  hydrogen  atom,  the  effects  of  spin  are  fairly  significant00,  but  also  relevant  to  the  discussion  here 
is  the  fact  that  the  Dirac  equation  contains  first-order  derivatives  with  respect  to  the  spatial  coordinates 
rather  than  second-order  derivatives.  This  leads  to  a  richer  FW  transformation  with  more  terms,  and  so 
more  physics.  The  significance  of  replacing  the  Dirac  equation  with  the  Klein-Gordon  equation  (what  was 
called  the  second  approximation  in  the  paragraph  above)  is  discussed  a  little  further  in  Appendix  F.  1 .  The 
third  major  approximation  mentioned  in  the  paragraph  above  involves  replacing  the  diagonal  matrix 
potential  of  a  pionic  atom  with  a  bona  fide  scalar  potential  that  looks  more  like  a  sound  speed  function  // 
from  the  acoustic  problem.  As  discussed  in  Appendix  F.2,  this  leads  to  further  loss  in  the  rich  structure  of 
the  problem.  Since  the  so-called  Darwin  term  associated  with  the  atomic  problem  provides  a  window  into 


00  In  fact,  this  term  corresponds  to  vacuum  polarization,  which  should  indeed  be  modest  in  size  compared  with  the 
primary  contribution  to  the  Lamb  shift,  and  it  should  also  carry  the  opposite  sign.  See  Section  4.4  for  a  little  further 
discussion  on  this  topic. 

dd  For  example,  quantum  field  theories  such  as  QED  address  the  symmetry  properties  of  the  particle  field.  Thus, 
electrons  are  fermions  and  their  wave  functions  must  be  made  antisymmetric,  while  solutions  to  the  Klein-Gordon 
equation  such  as  pions  are  bosons  and  their  wave  functions  should  be  symmetrized.  Such  considerations  are  omitted 
from  our  semiclassical  approach. 

ee  This  is  the  most  significant  omission  in  our  “toy  model.”  A  properly  modified  time-dependent  Dirac  equation 
would  predict  an  anomalous  magnetic  moment,  and  this  provides  most  of  the  missing  shift.  This  will  be  discussed  a 
little  further  in  Section  4.4. 


Using  the  Foldv-Wouthuvsen  Transformation 


45 


the  diminishing  richness  of  the  problem,  and  it  is  in  some  contexts  similar  enough  to  the  Lamb  shift  that  it 
may  cause  some  confusion,  it  is  briefly  examined  in  Appendix  F.3. 

3.2.5  The  Key  Insights  to  Emerge  from  these  Examples 

To  conclude  this  section,  note  that  there  are  two  significant  insights  that  we  should  take  out  of  our 
examination  of  the  classical  Lamb  shift.  They  both  center  on  the  discovery  of  the  downrange  Lamb  shift 
(as  noted  in  footnote  cc,  this  is  the  classical  equivalent  of  the  “vacuum  polarization”  correction  to  the 
Lamb  shift).  This  is  a  new  physical  effect  that  has  not  previously  been  included  by  classical  field  PEs,  and 
in  the  future  it  may  need  to  be  incorporated  to  improve  the  accuracy  of  PE  models.  Perhaps  even  more 
significantly,  we  have  discovered  something  profound  about  the  PEs  generated  by  the  FW  transformation. 
Let  us  take  a  moment  to  reflect  on  what  we  have  just  seen.  The  time  (or  downrange)  fluctuations  were 
introduced  by  hand.  When  the  resultant  [time/range] -dependent  potential  (or  equivalently  sound  speed 
function  /LI )  are  [time/range] -averaged,  these  will  induce  a  contact  potential  correction  to  the  (now 
[time/range] -independent)  average  problem.  This  much  constitutes  Welton’s  model  of  the  atomic  Lamb 
shift.  However,  the  externally  imposed  [time/range]  dependence  also  induces  secondary  virtual 
fluctuations,  and  these  automatically  show  up  in  the  form  of  new  terms  in  the  FW  transformation.  This 
forms  the  new  type  of  Lamb  shift  effect:  the  downrange  Lamb  shift.  Very  significantly,  note  that  the  FW 
transformation  has  automatically  dressed  the  singularity,  even  before  there  has  been  any  explicit  time 
averaging.  This  will  be  very  significant  later,  when  we  will  count  on  the  FW  transformation  to  similarly 
dress  the  singularity  associated  with  a  density  jump. 

3.3  The  Downrange  Stepping  Procedure  Near  an  Interface 

In  quantum  mechanics,  the  usual  concern  is  determination  of  observables’  expectation  values.  In  a 
typical  calculation,  these  are  obtained  using  quantum  mechanical  perturbation  theory.  Needed  in  this  case 
are  a  list  of  various  (in  some  sense  small)  perturbing  potentials  to  be  added  to  the  basic  problem,  and  the 
eigenstates  for  the  basic  problem.  In  classical  field  theory,  on  the  other  hand,  we  are  primarily  interested 
in  using  the  PE  to  simulate  downrange  propagation  of  the  field,  often  for  a  specific  deterministic  problem. 
In  problems  involving  propagation  through  large-scale  features,  this  usually  means  propagating  through 
an  environment  that  has  been  mapped  out  by  independent  measurements.  When  the  problem  also  involves 
a  stochastic  distribution  of  small  (Bragg)  scale  features,  the  known  large-scale  deterministic  environment 
may  be  augmented  with  a  synthetic  realization  generated,  for  example,  in  the  context  of  a  Monte  Carlo 
calculation.  As  discussed  briefly  in  the  introduction,  it  is  possible  to  model  a  field  propagating  through 
such  a  complex  environment  by  discretizing  the  problem  onto  a  grid  and  using  the  PE  to  step  downrange. 
When  range-dependent  interfaces  (or  a  cusps)  are  present,  this  will  entail  stepping  across  tilted 
boundaries.  This  is  a  nontrivial  operation,  and  we  need  to  devise  a  robust  procedure  for  doing  so. 
Therefore,  this  section  closely  examines  the  mechanics  of  stepping  across  a  tilted  interface. 

As  noted  in  Section  2.1,  the  explicit  formal  development  conducted  in  this  report  is  restricted  to  a 
consideration  of  quasiplanar  one-dimensional  interfaces  of  the  basic  form  z  =  f  {x)  embedded  in  a  two- 
dimensional  space  where  x  is  the  range  and  z  is  the  depth  (recall  that  the  positive  z  -axis  points  upward 
by  convention).  In  this  case,  the  (now)  scalar  transverse  gradient  Vr  will  be  used  interchangeably  with 
d/dz  .  The  key  ideas  needed  to  generalize  the  procedure  applicable  to  the  restricted  two-dimensional 
problem  to  the  full  three-dimensional  problem  are  briefly  outlined  in  Appendix  G. 

By  our  convention,  in  this  report  quantities  above  the  interface  (typically  the  water  column)  will  be 
associated  with  a  Roman  numeral  7,  and  those  below  the  interface  (typically  the  sub-bottom)  will  be 
associated  with  a  Roman  numeral  II.  Note  that  the  positive  z  -axis  then  points  from  region  II  into  region  I. 
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3.3.1  Discretizing  the  PE 

Consider  some  field  %  that  obeys  a  generic  PE  —id%/dx  =  H % .  The  Hamiltonian  H  is  a  function 
of  the  transverse  derivative  Vr .  The  leading  order  term  is  proportional  to  .  The  equation  is 

discretized  to  first  order  in  the  step  size  Ax  using  the  Mean  Value  Theorem.  The  basic  stepping 
algorithm  isff: 

Jnew  =  ZcM  +  i(H old  Jold  )  •  A*  +  o((ACf )  .  (55) 

To  implement  this  algorithm,  set  a  certain  default  (maximum)  downrange  step  size  Axmax  and  fix  the 

vertical  grid  spacing  A z  ■  Note  that  the  slope  is  small  and  the  transverse  derivatives  are  of  higher  order 
than  the  downrange  derivative.  Both  these  observations  independently  suggest  that  the  vertical  grid 
spacing  should  be  somewhat  finer  than  the  default  step  distance88  (i.e.,  Az  <  Axmax ).  Next,  adjust  the 

downrange  step  size  Ax  so  that  the  surface  falls  on  a  grid  point  at  each  and  every  step.  (See  Fig.  9;  for 
the  purposes  of  illustration,  vertical  distances  have  been  exaggerated  in  the  figure.)  The  default 
(downrange)  step  increment  Av'inax  only  gets  used  when  the  surface  is  flat  (i.e.,  range-independent).  Since 

the  slope  of  the  interface  is  always  small  (we  will  see  that  it  must  be  <  45°),  the  vertical  grid  spacing 
always  remains  finer  than  the  horizontal  step  size,  even  when  the  adjusted  downrange  step  size  is  smaller 
than  the  default  step  size  (i.e.,  Az<  Ax  <  Av'inax ).  If  the  surface  does  not  reach  the  next  higher  vertical 
grid  point  during  a  maximum  x  -increment,  the  surface  is  effectively  flat.  The  minimal  resolvable  slope  is 
therefore  determined  by  the  size  of  the  (fixed)  vertical  grid  spacing  A z  relative  to  the  default  (maximum) 
downrange  step  size  Av'max . 


Fig.  9  —  The  grid  to  be  used  in  the  stepping  algorithm  is  set  up  according  to  the 
following  rules: 

i.  Choose  a  maximum  downrange  step  size  as  the  default  step  size. 

ii.  Choose  vertical  grid  spacing  (less  than  the  default  downrange  step  size). 

iii.  Adjust  downrange  stepping  so  the  interface  falls  on  a  grid  point. 

iv.  Minimal  resolvable  slope  is  determined  by  the  size  of  the  vertical  grid 
spacing  relative  to  the  default  downrange  step  size. 


ff  As  discussed  in  Section  2.1,  in  practice  this  basic  algorithm  is  modified  using  the  Crank-Nicholson  procedure,  and 
sometimes  also  the  Pade  approximation.  These  modifications  are  not  used  in  the  formal  development  pursued  here. 
It  is  straightforward  to  implement  the  Crank-Nicholson  procedure  on  the  results  developed  here,  but  the 
development  of  the  appropriate  Pade  approximations  necessitates  future  research. 

gs  Note  that,  in  practice,  one  typically  takes  out  an  exp(;T0x)  from  the  wave  function,  so  away  from  the  interface, 
the  wave  function’s  rate  of  change  associated  with  x-dependence  and  that  associated  with  “-dependence  are 
comparable.  Therefore,  a  higher-order  derivative  in  the  (vertical)  ^-direction  indeed  demands  a  finer  step  size  in  that 
direction. 
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Appendix  Section  G.  1  discusses  the  major  issues  related  to  generalizing  the  stepping  algorithm  derived 
here  (i.e.,  for  a  one -dimensional  interface  embedded  in  two-dimensional  space)  to  the  problem  of  a  two- 
dimensional  surface  embedded  in  three-dimensional  space.  Assuming  the  third  dimension  is  given  by  the 
unit  vector  y ,  different  y  =  constant  lines  of  grid  points  would  have  to  be  stepped  separately  so  that 
they  always  hit  the  interface.  Since,  as  discussed  in  Section  3.3.2. 1,  evaluation  of  H /  in  a  discretized 
space  inherently  involves  a  neighborhood  of  nearby  points  in  the  transverse  grid,  it  is  impossible  to 
completely  isolate  y  =  constant  lines,  and  some  fancy  formal  work  must  be  used  to  address  this  issue.  As 
we  continue  with  the  development  of  the  basic  problem  begun  here  in  Section  3.3.1,  it  is  important  to 
once  again  remind  ourselves  that  along  with  the  current  report,  virtually  all  other  current  implementations 
of  the  PE  are  also  restricted  to  one-dimensional  interfaces  embedded  in  two-dimensional  space  (i.e., 

Z  =  f  (x)  and  Vr  =  d/dz ). 

The  variable-step  algorithm  described  here  is  useful  for  theoretical  development.  In  a  practical 
numerical  implementation,  it  may  prove  useful  to  fix  the  step  size.  This  simplifies  the  numerical 
procedure,  particularly  in  the  three -dimensional  case  or  if  there  are  multiple  interfaces  present.  The 
alternative  stepping  algorithm  with  fixed  step  size  is  briefly  examined  at  the  end  of  Section  3. 3.2.4. 

3.3.2  Evaluating  the  Hamiltonian  at  a  Penetrable  Interface 

3.3.2.1  Implementing  the  Stepping  Procedure  at  a  Boundary 

As  we  begin  our  examination  of  the  stepping  procedure,  assume  that  the  wavefunction  %  is 
continuous  at  the  interface,  but  make  no  assumption  about  the  continuity  of  the  transverse  derivative 
dx/dz  at  the  interface.  The  somewhat  artificial,  but  nonetheless  useful,  case  of  %  discontinuous  at  the 
interface  is  considered  separately  in  Section  3.3.4.  On  the  other  hand,  there  is  little  point  in  separately 
considering  the  case  / dz  continuous.  As  is  explained  in  Section  3. 3.2. 3  (just  after  Eq. 
(57)),  very  little  simplification  and  no  new  insights  emerge  from  the  imposition  of  this  condition. 

The  stepping  procedure  is  iterative.  Assume  that  the  field  X  is  given  at  a  certain  value  of  the  range, 
and  that  downrange  stepping  advances  to  the  right.  Along  the  old  (i.e.,  already  given)  slice  of  grid  points, 
specifically  assume  that  where  the  interface  crosses  the  grid,  the  field  is  known  on  the  right  side  (i.e.,  the 
downrange  side)  of  the  interface.  The  stepping  algorithm  is  now  used  to  step  infinitesimally  close  to,  but 
just  to  the  left  of  the  next  point  where  the  grid  intersects  the  interface.  The  field  X  is  continuous  at  the 
interface  and  so  the  value  of  the  field  X  is  now  known  at  all  the  new  grid  points  to  the  right  of  the 
interface.  The  next  step  is  to  operate  on  X  with  the  discretized  form  of  H  in  order  to  evaluate  H X  (the 
generator  of  downrange  stepping)  just  to  the  right  of  the  interface.  Once  this  quantity  is  obtained,  a  new 
downrange  step  follows  immediately.  The  challenge  lies  in  the  mechanics  of  evaluating  H%  in  the 

vicinity  of  the  interface.  Specifically,  recalling  that  the  leading  order  term  in  H X  involves  V'^X  ,  we 
need  to  take  derivatives  up  to  and  including  this  order.  It  turns  out  that  in  order  to  do  so,  it  is  necessary  to 
evaluate  all  the  transverse  derivatives  V^,  with  /  up  to  but  not  necessarily  including  this  order  on  both 
sides  of  the  in  terface.  The  m'h  order  derivative  will  then  be  obtained  on  one  side  of  the  interface  and  H X 
is  evaluated  directly  on  that  side.  If  that  side  happens  to  be  to  the  right  of  the  interface,  then  we  are  ready 
for  the  next  step.  If  the  side  where  H X  is  known  happens  to  be  to  the  left  of  the  interface,  an  implicit 
boundary  condition  will  be  used  to  obtain  H X  on  the  right  side. 
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3.3.2.2  Using  Boundary  Conditions  to  Evaluate  Derivatives  at  an  Interface 

We  now  examine  the  procedure  for  using  boundary  conditions  to  take  derivatives  (up  to  the  needed 
order)  of  the  discretized  field  %  in  the  vicinity  of  the  interface.  The  procedure  is  iterative.  The  lowest 
order  is  calculated  first,  and  then  this  information  is  used  to  obtain  the  second  derivative,  and  so  forth1’11. 
Thus,  we  begin  by  considering  the  first  derivative.  The  discretized  transverse  derivative  of  %  is  given  by 
the  difference  between  the  values  of  %  evaluated  at  two  adjacent  vertical  grid  points  divided  by  the 

separation  distance  Az  between  them.  Whether  this  discretized  derivative  is  assigned  to  the  top  point  or 
the  bottom  point  is  a  matter  of  convention.  For  the  first  derivative,  choose  to  identify  it  with  the  top  point. 
Now  there  is  an  orphaned  grid  point:  the  point  just  above  the  interface.  No  finite  difference  can  be 
assigned  to  it.  This  is  where  the  boundary  condition  on  the  first  derivative  comes  in.  It  is  used  to  deduce 
the  value  of  the  grid  point  just  above  the  interface  from  the  value  of  the  derivative  just  below  the 
interface.  All  odd  number  derivatives  will  turn  out  to  work  the  same  way.  (See  Fig.  10a.)  For  the  even 
derivative,  things  are  just  a  little  different. 

The  reason  the  second  derivative  and  all  even-order  derivatives  are  different  is  that  the  derivatives 
should  be  kept  roughly  centered  on  the  point  to  which  they  are  assigned.  For  example,  the  second 

derivative  of  Y  at  the  point  labeled  j  + 1  should  be  given  by  dr  zi^Zd  ~ 

L  -h=Zy+l 

[xJ+1  ~  2^” 1  +  XJ )  j (Az)  -  This  is  the  value  typically  used  in  numerical  implementations.  To  obtain 

this  value,  recall  that  the  second  derivative  is  the  derivative  of  the  first  derivative,  and  postulate  that  the 
discretized  derivatives  of  the  first  derivative  must  be  assigned  to  grid  points  using  the  opposite  convention 
from  the  one  used  to  obtain  the  first  derivative.  Since  the  first  derivative  was  assigned  to  the  upper  grid 
point,  the  derivative  of  the  first  derivative  is  assigned  to  the  lower  grid  point.  These  conventions  give  us 
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and  the  desired  result  above  follows  immediately.  Note  that  when  we  take  the  derivative  of  the  derivative, 
the  orphaned  grid  point  lies  below  the  interface,  and  now  this  value  must  be  obtained  from  the  result  just 
above  the  interface  using  the  boundary  conditions.  Since  the  procedure  is  iterative,  it  continues  as 
described  for  the  first  and  second  derivatives,  with  the  convention  assigning  derivatives  to  grid  points 
alternating  as  described  above11.  (See  Fig.  10.) 


hh  In  most  cases,  boundary  conditions  for  derivatives  of  given  order  are  independent  of  any  higher-order  derivatives, 
and  the  implementation  is  completely  straightforward;  if  not,  the  iterative  procedure  outlined  below  at  least  produces 
the  appropriate  number  of  equations  and  unknowns,  and  the  problem  remains  well-defined  and  soluble. 

11  Note  that  as  we  continue  to  go  up  to  higher  orders  n,  the  numerical  evaluation  of  V^z  in  the  discrete  problem  will 
require  the  participation  of  an  increasing  number  of  vertical  grid  points. 
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a)  For  n  odd: 


b)  For  n  even: 


Fig.  10  —  The  conventions  for  discretizing  the  derivative  near  an  interface 
is  shown  for  a)  n  odd  and  b)  n  even.  When  n  is  odd,  the  convention  assigns  finite 
difference  approximations  of  the  derivative  to  the  top  point.  When  n  is  even,  the 
convention  flips  so  that  higher-order  derivatives  remain  centered:  for  example 
for  n  =  2,  the  second  derivative  is  assigned  to  the  bottom  point  and  we  end  up 
with 


(vTjf2-(vTjy+1  r 2/"  -i  r 

[tX)  "  Az  “  (Az)2  ’ 

At  the  interface,  there  is  always  one  orphaned  value  that  must  be  evaluated  using 
the  boundary  conditions. 


3.3.2.3  The  PE  Spawns  a  New  Family  of  Boundary  Conditions 

There  is  a  subtle,  but  very  important  distinction  between  the  PE  and  the  Helmholtz  equation.  The  latter 
is  second  order  in  the  transverse  derivative,  so  the  solution  is  fully  specified  by  two  boundary  conditions 
at  a  penetrable  interface.  However,  a  PE  that  is  m"'  order  in  the  transverse  derivative  is  well  posed  only 
once  m  boundary  conditions  have  been  specified  on  the  penetrable  interface  [67,  68],  This  formal  issue 
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of  well-posedness  in  the  continuum  limit  is  directly  tied  to  the  practical  problem  of  how  to  calculate  the 
higher-order  derivative  needed  to  numerically  solve  the  discrete  problem. 

In  practice,  researchers  have  usually  sidestepped  this  issue  by  truncating  the  entire  PE  (including  the 
stepping  algorithm)  at  the  order  m  =  2  near  the  boundary  (i.e.,  at  first  order  in  the  PE  expansion 
parameter  X ),  and  in  this  way  avoided  the  issue  of  determining  the  extra  boundary  conditions^.  In  fact,  as 
outlined  in  Ref.  69  (Appendix  B),  the  technique  developed  above  for  evaluating  derivatives  near  the 
interface  has  not  been  used  in  the  standard  implementation  of  the  stair  step  PEkk.  The  alternate  approach 
described  in  the  reference  does  not  lend  itself  to  the  use  of  the  higher-order  boundary  conditions,  and  it  is 
not  appropriate  for  use  with  bona  fide  tilted  surfaces  (it  only  works  for  a  step-like  interface).  In  this 
report,  however,  a  method  for  determining  the  higher-order  boundary  conditions  is  developed  in  Section 
3.3.3.  For  the  moment,  we  are  exploiting  the  fact  that  these  m  boundary  conditions  must  exist  and  must 
have  been  specified  by  the  correct  formulation  of  the  problem. 

The  complete  formulation  of  the  problem  involves  m  explicit  boundary  conditions,  generally 
providing  boundary  conditions  on  %  to  V™  '  X  ■  This  allows  our  procedure  for  calculating  the  derivatives 
to  work  up  to  Vj^X  ■  Although  this  formally  specifies  the  problem,  this  all  by  itself  is  not  yet  enough  to 
evaluate  the  quantity  H X  ,  which  must  be  evaluated  in  order  to  proceed  with  our  stepping  algorithm.  The 
reason  is  that  this  quantity  contains  a  term  proportional  to  V”X  ■  Our  technique  for  taking  derivatives 
near  the  interface  will  allow  us  to  determine  V™X  on  one  side  of  the  interface  (for  the  conventions  in  Fig. 
10,  above  the  interface  if  m  is  even  and  below  if  m  is  odd),  but  we  now  do  not  have  the  value  of  Vj'X 
on  the  other  side.  Recall  that  this  quantity  would  be  needed  to  directly  evaluate  H X  on  that  side  of  the 
interface.  Since  the  Hamlitonians  H  for  the  PE  problems  that  interest  us  always  have  leading  order  m 
that  is  even,  we  can  evaluate  H X  above  the  surface.  If  the  slope  is  negative,  we  have  all  we  need  to 
proceed  to  the  next  step  of  the  stepping  procedure.  On  the  other  hand,  if  the  slope  is  positive,  we  need  to 
find  an  implicit  boundary  condition  that  allows  us  to  deduce  the  value  of  H X  below  the  interface.  (Note 

that  we  can  circumvent  the  need  to  explicitly  evaluate  the  missing  partial  derivative  V'^X  below  the 
interface  if  we  have  some  way  to  directly  determine  H X  on  that  side  of  the  interface.) 


The  implicit  boundary  condition  comes  from  the  equation  of  motion  tying  H  X  to  the  downrange 
derivative  combined  with  the  (a  priori  specified)  boundary  conditions  on  X  and  ^x/dz  ■  The  latter  are 
used  to  deduce  the  boundary  condition  on  dx/dx,  which  is  then  used  in  conjunction  with  the  PE 
equation  to  deduce  the  boundary  condition  on  H X  ■  For  most  PE  applications,  it  can  be  assumed  that  X 


11  This  is  a  significant  omission.  Over  the  last  20  years,  much  of  the  research  related  to  the  PE  has  focused  on 
obtaining  ever-higher  orders.  For  example,  one  of  the  key  advantages  of  the  Pade  approximation  is  that  it  allows  the 
efficient  calculation  of  the  high  order  PE.  Progress  in  this  direction  has  allowed  the  PE  method  to  be  applied  to  steep 
grazing  angles,  and  to  cases  where  the  environmental  parameters  significantly  vary  as  a  function  of  the  depth.  The 
latter  is  of  particular  relevance  to  the  issue  of  a  density  jump  at  the  ocean  bottom.  Here,  the  density  may  jump  by  as 
much  as  a  factor  of  two,  usually  the  largest  variation  of  the  environmental  parameters  to  be  found  in  the  entire 
problem.  Furthermore,  high  grazing-angle  results  are  suspect  when  such  a  crucial  part  of  the  problem  is  restricted  to 
first  order. 

kk  Instead,  one  places  the  interface  in  between  two  grid  points.  Then,  one  overlaps  the  upper  and  lower  half-spaces 
by  extending  each  out  beyond  the  interface  by  one  extra  grid  point.  Lowest  order  theory  (i.e.,  the  one  with  a  lead 
second  order  transverse  derivative  in  the  Hamiltonian)  is  used,  so  there  are  only  two  boundary  conditions  involved. 
The  two  boundary  conditions  are  used  to  solve  for  the  field  at  the  extra  (nonphysical)  points.  The  extra  points 
provide  enough  information  to  calculate  the  second  transverse  derivative  in  the  first  order  Hamiltonian,  which  is  the 
one  used  to  generate  downrange  stepping  right  at  the  surface. 
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is  continuous;  however,  density  jumps  will  lead  to  discontinuities  in  dx/dz  ■  The  continuity  of  %  implies 
that  the  difference  between  the  values  of  the  field  %  evaluated  on  the  two  sides  of  the  interface  is  zero 
(i.e.,  Xu  ~  X\—  =  Oon  the  surface),  which  implies  the  tangential  derivative  of  the  difference  is  also 

zero:  t  ■  V  (Ax)  =  0 .  Writing  out  the  tangential  derivative  in  terms  of  dx/dx,  dx/dz,  and  the  local 
slope  /  leads  to  an  equation  for  dx / dx  : 


d(Ax)  |  ,d(Ax)_0_  dx, 

dx  '  dz  dx 


Z  dXn 
dx  '  dz 


(56) 


and  from  the  PE  dx/dx  =  H X /— I  we  have  for  H  X  ■ 


H uXu  —  H\X\  if 


dz 


dZ 

dz 


(57) 


The  slope  /  used  is  that  of  the  upcoming  step.  The  quantities  on  the  right-hand  side  of  Eq.  (57)  are 
known  and  used  to  calculate  H uxu  (recalling  that  side  II  is  below  the  interface,  and  so  HuZu  cannot 
be  determined  directly  under  the  conventions  used  in  this  report). 

3.3.2.4  Some  Final  Thoughts  Concerning  the  Hamiltonian  at  an  Interface 

Now  we  can  see  how  little  would  have  been  gained  had  we  separately  considered  the  case  dx/dz 
continuous.  Equation  (57)  would  reduce  to  the  simple  condition  that  H X  be  continuous  at  the  interface, 
but  this  is  the  only  gain  and  it  represents  a  very  minor  simplification.  The  derivative  can  still  jump 

dramatically  at  the  boundary,  since  the  continuity  of  H X  depends  on  delicate  internal  cancellations.  For 
example,  consider  the  very  simple  Hamiltonian  H  =  k0  +  A .  Infinitesimal  integration  of  the  equation 
across  the  interface  of  the  type  discussed  in  Section  3.2.3  shows  that  X  and  d% / dz  are  both  continuous. 
Then,  from  Eqs.  (56)  and  (57),  dx/dx  and  H X,  respectively,  are  continuous  as  well.  Continuity  of  H X 
implies  that  H tx,  —  H nXu  =  0  and  so  we  have  shown  that  VjX  has  a  jump  proportional  to  kl/UX  ■ 

Note  that  just  as  for  more  complicated  situations  where  we  do  not  have  continuity  of  dx/dz  ,  we  still  will 
need  to  use  our  full  set  of  boundary  conditions  to  step  across  the  interface. 

Although  the  given  set  of  boundary  conditions  that  fully  specifies  our  problem  may  or  may  not 
explicitly  contain  the  slope,  the  algorithm  nevertheless  remains  sensitive  to  the  local  slope.  For  instance, 
slope  dependence  is  built  into  Eq.  (57),  the  implicit  boundary  condition  for  H X  ■  A  very  small  slope  that 
is  below  some  minimum  threshold  will  generate  a  flat  surface  where  successive  grid  points  on  the 
interface  remain  at  the  same  height.  As  illustrated  in  Fig.  11,  such  a  flat  interface  may  be  punctuated  by 
intermittent  “boosts”  where  the  slope  takes  on  its  minimum  value  for  a  single  step.  This  minimum 

allowed  value  for  the  slope  is  fixed  by  the  grid  size:  /fixed  =  fmm  =  Av'inax /Az . 
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When  the  actual  slope  is  below  the 
minimum  threshold  for  the  discretization  or 
when  the  downrange  step  size  is  fixed 


/  =  0  for  most  steps 


Fig.  11  —  For  a  discretization  to  resolve  smaller  and  smaller  slopes,  a  vertical  grid  that  is  finer 
and  finer  relative  to  the  default  (downrange)  step  size  is  needed.  If  the  vertical  grid  is  too  coarse 
relative  to  the  step  size,  the  interface  remains  at  the  same  height  during  successive  steps,  and  only 
jumps  occasionally.  Now,  the  discretization  can  no  longer  resolve  the  local  slope.  The  presence  of 
a  small  locally  invisible  slope  can  still  cause  a  cumulative  effect  whereby  after  many  downrange 
steps,  a  jump  in  the  interface  is  eventually  needed.  At  this  jump,  the  slope  is  artificially  high  for 
one  step.  This  is  an  artifact  that  can  be  made  to  disappear  by  using  a  finer  vertical  grid  or  a  larger 
maximum  step  size,  but  this  might  not  always  be  a  viable  option.  At  any  rate,  the  occasional 
appearance  of  a  “boost”  that  gives  the  slope  a  nonzero  value  for  a  single  step  is  only  a  mild  source 
of  error  since  the  slope  here  is  just  the  minimum  slope  that  our  discretization  would  be  capable  of 
resolving.  There  are  still  no  corners  and  steps  on  the  surface,  just  mild  steps  in  the  slope  associated 
with  discretization.  Note  that  since  we  always  use  the  slope  for  the  upcoming  step,  our  procedure 
always  gives  H  %  continuous  along  a  horizontal  portion  of  the  interface.  Also  note  that  for  the 
purposes  of  illustration,  the  slopes  have  once  again  been  exaggerated.  Finally,  note  that  the  figure 
also  applies  when  the  downrange  step  size  is  fixed.  Now,  the  fixed  slope  is  the  maximum  slope 
resolvable  by  the  algorithm. 


Fig.  11  also  suggests  an  alternate  to  the  variable-step-size  algorithm  presented  in  Section  3.3.1. 
Particularly  in  the  three-dimensional  case  or  if  there  are  multiple  interfaces  present,  it  is  useful  to  simplify 
the  numerical  implementation  by  dispensing  with  variable  sizes  Ax  altogether,  and  fixing  the  downrange 
step:  Ax  =  Axfixed  .  This  would  be  similar  to  current  implementations,  but  the  (distorted)  interface  would 

lie  right  on  top  of  the  grid  (rather  than  in  the  middle  between  grid  points),  and  (more  significantly)  rather 
than  having  stair  steps,  the  (distorted)  interface  would  remain  flat  for  a  series  steps  before  acquiring  some 

fixed  finite  slope  for  a  step  (as  in  Fig.  11):  /fixed  =  A.rfixed / Az .  This  fixed  slope  is  now  the  maximum 

slope  compatible  with  the  given  implementation  (i.e.,  when  the  interface  is  at  maximum  slope,  it  would 
move  vertically  with  every  downrange  step).  In  this  case,  the  relative  size  of  vertical  and  horizontal  grid 
spacing  would  be  determined  by  the  desired  maximum  slope  and  not  by  the  considerations  mentioned  in 

the  second  paragraph  of  Section  3.3.1  (i.e.,  now  /fixed  =  /max ).  In  this  case,  note  that  the  generalization  of 

the  stepping  algorithm  to  three  dimensions  is  now  comparatively  trivial  (i.e.,  the  need  for  interpolation 
described  in  Appendix  G.l  has  been  eliminated).  Also  note  that  the  presence  of  more  than  one  surface 
poses  no  problems  for  this  alternate  stepping  algorithm.  Finally,  in  a  practical  implementation  of  the  new 
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theory,  one  would  expect  streamlined  efficient  programs  to  work  best.  The  stepping  convention  outlined 
in  this  paragraph  avoids  the  complex  code  needed  to  vary  the  step  size  (along  with  its  associated 
numerical  overhead),  and  so  it  may  well  be  the  preferred  choice  for  typical  practical  implementations. 
However,  the  variable-step-size  algorithm  described  above  provides  a  very  nice  context  for  the  formal 
discussion  that  follows,  and  we  implicitly  assume  that  it  is  the  one  being  used  throughout  the  rest  of  this 
manuscript. 

3.3.3  The  Boundary  Conditions 

To  facilitate  our  look  at  the  manner  in  which  boundary  conditions  arise  from  the  formalism,  let  us 
begin  by  considering  the  standard  two-dimensional  problem  with  the  range  x  and  the  transverse 
coordinate  the  depth  z  .  The  generalization  to  the  three-dimensional  problem  with  a  two-dimensional 
transverse  coordinate  RT  =  (  y,  z)  will  be  briefly  considered  at  the  end  of  Section  3. 3. 3. 2  and  then  further 
developed  in  Appendix  G.2. 

3.3.3.1  The  Relationship  Between  Contact  Potentials  and  Boundary  Conditions 


Boundary  conditions  such  that  V\z  —  1 


for  now)  are  discontinuous  come  from  “distributions” 


in  the  Hamiltonian  H  .  In  this  context,  a  distribution  is  a  generalized  function  that  constitutes  a  “spike”  or 
a  “contact”  potential  (such  as  the  8  -function).  For  example,  as  we  saw  in  Section  3.2.3,  if  IT  contains  a 
8  -function  at  the  interface,  then  as  in  the  Kronig-Penny  model  in  solid  state  physics  (see  Ref.  66)  or 
equivalently  as  for  the  cusp  in  our  discussion  of  the  classical  Lamb  shift,  we  obtain  a  boundary  condition 
by  integrating  the  PE  along  the  transverse  dimension  across  an  infinitesimal  interval  bracketing  the 


interface 


for  the  cusp  in  the  two-dimensional  problem;  there  is  a  very  similar  integral  in 


the  spatial  dimension  of  the  Kronig-Penny  model  for  a  one -dimensional  lattice).  With  the  leading 
derivative  of  second-order  (i.e.,  a  lead  order  term  oc  V\% ),  this  procedure  will  generate  a  discontinuity  in 
the  first  derivative  of  the  wave  function. 


When  higher-order  derivatives  of  the  8  -function  are  present,  this  example  can  be  generalized  by 
adapting  a  procedure  originally  derived  by  Heaviside  [70,  71].  Before  applying  Heaviside’s  procedure, 
however,  the  rules  for  converting  8  -functions  times  functions  of  the  transverse  coordinate  variable  z  to 
8  -functions  times  something  evaluated  at  the  fixed  point  (on  the  surface)  z  =  f  (x)  need  to  be 

employed  (specifically,  use  the  equation  taken  from  the  classic  text  by  Lighthill  [72]  and  adapted  in  Eq. 
(59)).  This  leaves  us  with  a  sum  of  distributions  times  a  “constant,”  where  the  “constant”  is  the  field  and 

its  derivatives  evaluated  at  the  fixed  height  of  the  interface:  z{f)  ar*d  ^rZ\  ■  Now  following 

Heaviside,  the  key  step  once  again  consists  of  an  integration  across  the  interface  over  an  infinitesimal 

rf+e  , 

interval  in  the  transverse  direction  (i.e.,  lim  cfe •  •  •  for  the  two-dimensional  problem).  This  time,  the 

£— >0  J  f-e 

definite  integration  over  the  infinitesimal  interval  may  in  general  come  after  one  or  more  indefinite 
integrations  -  or  completely  equivalently,  integration  over  the  interval  from  — < °°  to  the  new  variable  zk  , 
where  it  is  understood  that  the  field  is  “turned  off’  at  — < °°  . 


To  write  all  this  symbolically,  take  the  basic  PE  H Z  +  idz/dx  =  0  and  perform  /7  +  1  “indefinite” 
integrations  followed  by  one  definite  integration.  This  gives  us  (for  the  two-dimensional  problem): 
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\ZdzLdz'  •  -\Zdz^  (^L  +\fZdz\Zdz'  •  "Dte 


d.x 


=  0 , 


(58) 


Z/i+i 


where  it  is  understood  that  we  must  always  make  the  substitution 


P=0pl(k-p)l  dzp 


=  8{k)(z-f)z(f)  +  --- 


(59) 


Z=f 


The  current  assumption  that  the  wave  function  %  is  continuous  implies  that  the  first  downrange 

'f+£  '$X/ 


derivative  /d.x  does  not  contain  a  distribution  and  so  J  dz  0  (or  more  generally 

|  rfej  dz}  ■  j*  dz.n+l  j  ~ ^  and  only  infinitesimal  integration  over  H  %  can  give  a  nonzero 

contribution  to  Eq.  (58).  Thus,  we  can  confine  ourselves  to  integrating  over  H  %  to  obtain  the  boundary 
conditions: 

•  /+f 


(60) 


Equation  (60)  is  of  course  subject  to  the  understanding  given  by  Eq.  (59).  Note  that  when  n  =  0 ,  we  have 

C*  l>.(«r)L  =0. 


and  we  additionally  also  have  the  equation 

\ffyZ{Hx)\=o.  (6i) 

The  indefinite  integrations  pull  off  Vr  ’s  from  the  lead  order  derivative  in  H  .  After  n  indefinite 
integrations,  the  leading  order  term  (in  our  notation,  it  is  <=c  )  is  left  with  m  —  n  transverse 

derivatives  operating  on  the  wave  function  V”  n  % .  The  infinitesimal  definite  integration  peels  off  one 
more  transverse  derivative  and  then  gives  the  difference  between  the  value  of  this  function  evaluated  on 
the  two  sides  of  the  interface:  V”~n~  ^  —V'/  "  lZn  ■  Nonzero  values  for  this  difference  come  from  the 
spike  potentials.  A  8  -function  provides  a  nonzero  contribution  after  just  one  (infinitesimal)  integration 
and  in  this  way  generates  a  jump  in  Vj"  —  V'/  'Zn  ■  Since  each  indefinite  integration  also  pulls  off  an 
order  from  the  derivative  in  the  spike  potential,  a  S'  -function  becomes  a  8  -function  after  one 
(indefinite)  integration,  and  it  therefore  provides  the  jump  after  a  total  of  two  integrations.  More 
generally,  8^  -functions  will  generate  jumps  after  a  total  of  n  +  1  integrations™11.  Note  that  if  no 
distribution  proportional  to  8^  is  present,  then  performing  a  total  of  n  + 1  integrations  simply  yields  a 
continuity  condition. 


11  This  equation  can  be  derived  using  the  following  iterative  argument:  Take  the  derivative  of  the  formula  for 
(z  —  f)z(z) ,  and  expand  using  the  product  rule  for  differentiation.  For  example,  treating/like  a  constant  and 
taking  z(/-)  to  be  some  general  test  function,  take  d/dz  of  the  equation  S(z  —  f)%(z)  =  S(z  —  f)z(f)  and 
rearrange  to  get  S'  (z  —  /)  %(z)  =  S'(z  —  f)  z{f)~  8(z-  f)  X  if)  ■  The  procedure  can  then  be  repeated. 
m  Note  the  integrals  of  these  distributions  are  zero  unless  the  number  of  integrations  is  just  right:  too  few 
integrations,  and  the  function  integrates  to  zero  under  the  definite  (infinitesimal)  intergration  (i.e.,  the  positive  and 
negative  areas  under  the  curve  cancel);  too  many,  and  the  infinitesimal  integration  kills  the  term  off  (i.e.,  the  S  - 
functions  have  all  integrated  out,  leaving  behind  an  integral  of  a  bounded  function  over  an  infinitesimal  interval). 
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3.3.3.2  Related  Technical  Issues  that  are  Addressed  Elsewhere  in  this  Study 

The  needed  distributions  in  H%  are  generated  both  by  higher-order  cross-terms  as  we  expand  the 

standard  square  root  operator  (yjl  +  2  A /k0  when  the  density  is  the  same  everywhere)  -  e.g.,  see 

Appendix  K.1.1  -  as  well  as  by  the  new  FW  correction  terms  -  as  we  have  seen  in  Section  3.2  and  will 
again  see  in  Section  4.1  and  Appendix  M.  Note  that  by  deriving  the  boundary  conditions  from 
distributions  embedded  within  a  Hermitian  Hamiltonian,  we  guarantee  energy  conservation  (and  so 
stability  as  well). 

If  the  needed  “constant”  coefficient  of  the  distribution  jumps  (e.g.,  we  have  8(z  —  f)  •  (/) 

where  we  have  already  determined  that  (/')  is  not  continuous  at  the  interface),  use  “  8  -function 

bifurcation.”  This  subtle  issue  is  explored  in  Section  5.2  when  an  understanding  of  it  becomes 
indispensable  for  considering  an  interface  with  a  density  jump  at  even  the  most  rudimentary  level.  8  - 
function  bifurcation  will  follow  rules  determined  by  examining  a  series  of  test  cases  for  which  the 
solution  is  known.  For  convenience,  the  rules  are  summarized  in  the  footnote  below™,  but  the  topic  is 
revisited  in  Section  5.2  and  again  in  more  detail  in  Appendix  K. 

Appendix  G.2  presents  a  brief  examination  of  the  generalization  to  the  full  three -dimensional  problem 
of  the  procedure  developed  here  in  Section  3.3.3.  Again,  the  y  -axis  provides  the  extra  dimension.  The 
integrations  used  to  determine  the  boundary  conditions  are  now  in  the  direction  normal  to  the  cut  of 
interface  in  the  x  =  constant  plane.  Call  this  two-dimensional  normal  n2D .  Call  the  two-dimensional 

tangent  t2D  .  The  boundary  conditions  are  expressed  in  terms  of  n2D ,  t2D  and  Vr  ,  which  in  turn  can  be 

expressed  in  terms  of  3/ /dy  and  yd/d y  +  zd/dz  .  The  downrange  slope  /  is  as  before.  (Keep  in  mind 
that  the  explicit  boundary  conditions  obtained  in  Sections  4.1,  5.3.2,  and  6.1  (specifically  Eqs.  (71),  (99), 
and  (120),  respectively)  will  reflect  only  the  basic  formalism  for  the  two-dimensional  problem  as 
developed  here  in  Section  3.3.3.) 

3.3.3.3  Relationship  Between  the  Boundary  Conditions  in  the  PE  Problem  and 
those  for  the  Full- wave  Problem 

As  noted  in  Section  3.3.2  (and  in  Refs.  67  and  68),  the  boundary  conditions  that  emerge  from  the 
procedure  described  here  more  or  less  substitute  for  a  smaller  set  of  boundary  conditions  associated  with 
the  full-wave  problem.  The  manifest  reason  for  the  appearance  of  all  these  new  boundary  conditions  is 
that  as  soon  as  powers  of  A  appear  in  the  Hamiltonian,  then  the  leading  order  derivative  is  some  number 
m  >  2  ,  and  a  full  set  of  m  boundary  conditions  must  be  explicitly  specified  in  order  that  the  PE  problem 
be  well  posed.  As  the  order  of  the  PE  approaches  °° ,  the  number  of  required  boundary  conditions  also 
goes  to  oo ,  even  though  the  full  Helmholtz  equation  it  approximates  is  only  second  order,  and  thus 
requires  only  two  boundary  conditions. 


nn  The  basic  rule  is  that  the  first  5  -function  obtained  splits  in  half  (i.e.,  bifurcates).  The  two  halves  are  displaced  in 
opposite  directions  away  from  the  interface.  Any  other  distributions  in  the  product  collapse  since  they  are  smooth 
functions  in  the  half-spaces  away  from  interface.  The  procedure  is  associative  in  the  sense  that  it  does  not  matter 
which  5  -function  is  chosen  to  be  the  first  one.  On  the  way  to  generating  that  first  5  -function,  the  chain  rule  for 
differentiation  applies.  The  Vr  operating  on  the  5  -function  will  generate  higher-order  derivatives  of  the  5  - 
function.  It  is  permitted  to  multiply  through  by  a  density  since  it  involves  an  undistorted  step,  but  in  general  it  is  not 
permitted  to  multiply  through  by  distorted  steps  produced  by  taking  functions  of  steps.  These  rules,  and  the 
empirical  evidence  justifying  them,  are  examined  in  Section  5.2  and  in  further  detail  in  Appendix  K. 
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There  is  a  rather  interesting  complementary  interpretation  for  these  new  boundary  conditions.  Along 
the  tilted  interface,  these  “new”  boundary  conditions  also  serve  as  replacements  for  the  downrange 
derivative  of  the  field  in  what  would,  in  the  full-wave  problem,  be  the  boundary  condition  on  the  normal 
derivative  of  the  field.  Specifically,  when  the  normal  has  a  downrange  component,  the  normal  derivative 

d%/dn  =  h  ■  V/  picks  up  a  contribution  from  the  downrange  derivative,  but  in  the  PE,  we  now  find  that 
the  explicit  boundary  conditions  involve  only  transverse  derivatives  of  the  field.  In  other  words,  only  the 
transverse  portion  of  the  normal  derivative  has  survived  to  play  a  role  in  specifying  the  PE  problem.  This 
turns  out  to  be  a  very  desirable  result,  since  the  downrange  derivative  plays  a  special  role  in  the  PE:  it  is  a 
downrange  stepping  operator,  and  as  such  it  cannot  appear  in  the  Hamiltonian.  Including  the  stepping 
operator  id/dx  into  our  boundary  conditions  would  amount  to  inserting  this  operator  into  the 
Hamiltonian,  in  which  case  the  equation  would  cease  to  be  parabolic!  (The  implicit  boundary  condition 
(Eq.  (56))  and  its  alternate  form  (Eq.  (57))  are  simply  consequences  of  the  continuity  condition  on  % . 
Note  that  they  are  derived  conditions  on  the  Hamiltonian,  but  they  do  not  add  new  information  to  the 
Hamiltonian  beyond  that  already  contained  within  the  continuity  condition  on  % ,  and  so  they  play  no 
role  in  the  formal  specification  of  the  problem.  Thus,  they  are  fundamentally  different  from  the  explicit 
boundary  conditions  discussed  here  in  Section  3.3.3.  In  particular,  they  do  not  contradict  the  assertions  in 
this  paragraph.  ) 

Before  taking  a  look  at  examples,  which  illustrate  many  of  the  ideas  discussed  here  in  Section  3.3.3,  let 
us  examine  what  happens  if  we  relax  the  continuity  condition  on  % . 

3.3.4  Relaxing  the  Continuity  Condition  on  the  Wave  Function  % 

We  will  shortly  find  it  useful  to  consider  a  problem  such  that  %  is  discontinuous.  This  result  is 
associated  with  the  relatively  artificial  problem,  where  we  consider  “mixed  orders.”  These  are  obtained  by 
matching  high  order  FW  correction  terms  with  low  orders  of  the  expansion  of  the  yj i  +  2 ~A/k0  operator. 

For  example,  once  again  as  in  Section  3.2.2  include  only  A  ’s  but  not  downrange  derivatives  in  our  power 
counting  to  get  the  quasi-first  order  Hamiltonian 


H  =  k0  +  A — (62) 
8&o 

A  ^-discontinuity  boundary  condition  emerges  from  a  Hermitian  Hamiltonian  (such  as  Eq.  (62)) 
containing  distributions  as  follows.  Integrate  down  a  vertical  line  as  before.  Given  that  the  leading  order 
of  the  PE  is  ,  a  S'm  11  in  the  “potential”  will  produce  a  jump  in  %. 

Now,  given  the  needed  boundary  conditions,  propagate  to  the  interface  as  described  above  in  Section 
3.3.1,  and  use  the  boundary  conditions  on  the  field  %  and  its  transverse  derivatives  to  move  to  the  far 
side  and  obtain  the  new  value  for  H %  needed  to  perform  the  next  step. 

For  our  current  purposes,  we  ignore  the  8  -function-like  singularity  generated  by  d%/dx  when  the 
interface  of  ^-discontinuity  is  range -dependent  (i.e.,  tilted).  There  are  two  justifications  for  this.  In  the 
examples  that  will  interest  us,  the  missing  term  is  of  higher  order  than  the  Hamiltonian  (for  constant 
density  theory,  the  missing  terms  are  O  (/3 )  or  O  [ f  ■  /'  ■  Vr  j )  and  it  vanishes  completely  in  the  purely 
coherent  part  of  the  stochastic  case.  More  significantly,  %  will  jump  at  an  interface  in  only  a  handful  of 
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relatively  artificial  cases.  Using  a  full  higher-order  theory,  where  for  example  —  X/Sk^  is  considered 
third  order  and  matched  with  the  0(/l3)  leading  term,  then  the  cT -function  in  —  X/Sk^  will  not  lead  to 
a  discontinuity  in  %  (much  more  is  given  in  Section  4.2  about  changes  in  the  meaning  of  a  given  contact 
potential  as  the  lead  order  of  the  PE  changes).  The  continuity  of  %  is  also  used  extensively  in  Appendix 
G.2  to  extend  the  procedure  for  extracting  boundary  conditions  to  the  full  three-dimensional  problem. 
Thus,  it  will  ultimately  prove  fortuitous  that  all  consistent  and  fully  realistic  problems  will  have  % 
continuous.  Nevertheless,  the  Hamiltonian  (Eq.  (62))  will  prove  useful  in  isolating  the  new  physics 
associated  with  the  downrange  Lamb  shift  generated  by  an  interface  such  as  the  ocean  bottom,  and 
assessing  its  possible  significance  in  underwater  acoustics,  so  we  will  press  on  with  the  analysis. 

Finally,  let  us  generalize  the  implicit  boundary  condition.  Assume  X\  =  a% n  at  the  surface.  Now,  we 
have 

ax n  -/i=0on  the  surface 

=>  tangential  derivative  of  axn  ~  X\  is  zero 


he.,  t  ■  V (ax„  Xi)~  0 
9  (  Q-X n  —  +  f  ^  =  0 


Thus,  we  have  all  we  need  to  apply  the  technique  outlined  in  Sections  3.3.1  to  3.3.3  for  the  instance  where 
X  is  discontinuous.  This  will  prove  useful  in  Section  4,  as  we  move  first  to  an  interface  where  the  sound 
speed  jumps  and  then  also  consider  the  case  where  the  density  is  allowed  to  jump  as  well. 

4.  THE  SOUND  SPEED  JUMP— TILT-INDUCED  SMEARING  AND  THE 

CLASSICAL  LAMB  SHIFT 

In  this  section  we  consider  a  sound  speed  to  jump  and  closely  examine  two  examples  where  the 
surface  roughness  buffers  the  singularity  (i.e.,  smears  it  out): 

•  uprange/downrange  smearing  induced  by  a  deterministic  tilt  and  associated  with  the  new  term 
introduced  by  the  FW  transformation 

•  the  classical  Lamb  shift  induced  by  a  stochastic  rough  interface,  and  associated  primarily  with 
smearing  in  the  transverse  direction. 

In  Sections  5.3.2.  and  5.3.4,  the  study  here  in  Section  4  is  extended  to  the  case  where  the  density  jumps 
as  well.  We  study  these  phenomena  both  to  assess  the  effects’  potential  importance  in  underwater 
acoustics,  and  also  as  a  vehicle  for  the  further  development  of  our  formal  structure  in  anticipation  of  the 
development  of  a  PE  suitable  for  modeling  Bragg  scattering  from  a  rough  interface  where  the  density 
jumps.  We  begin  here  in  Section  4  with  the  case  of  a  sound  speed  jump  alone  in  order  to  fully  develop 
our  understanding  of  the  problem  in  this  relatively  simple  context.  In  Section  4.1,  we  once  again  consider 
a  quasi-first-order  theory  that  combines  first  PE  theory  with  the  new  nominally  third-order  term 
introduced  by  the  FW  transformation.  This  term  is  the  one  associated  with  new  tilt  and  curvature-induced 
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effects.  Next,  in  Section  4.2  we  set  the  curvature  of  the  interface  to  zero,  leaving  us  with  an  effect 
associated  with  tilt-induced  smearing.  The  resultant  boundary  conditions  along  a  deterministic  interface 
are  examined  in  detail.  We  find  that  the  relationships  between  some  of  the  boundary  conditions  for  the 
PE  and  their  corresponding  manifestations  in  the  full-wave  problem  are  often  quite  apparent  at  a  specific 
order,  only  to  become  surprisingly  obscure  at  other  orders.  In  Section  4.3,  we  examine  the  stochastic 
problem  and  obtain  the  classical  Lamb  shift  associated  with  a  sound  speed  jump.  Unlike  for  the  cusp,  the 
tilt/curvature-induced  correction  (essentially  smearing  in  the  uprange/downrange  direction)  associated 
with  the  new  FW  term  disappears  from  this  stochastic  problem,  leaving  us  only  with  smearing  in  the 
transverse  direction.  In  Section  4.4,  the  new  physics  associated  with  the  tilt/curvature-induced  correction 
and  with  the  classical  Lamb  shift  is  placed  into  the  context  of  other  physical  phenomena.  In  particular, 
we  see  at  the  end  of  Section  4.4.3  that  tilt-induced  smearing  is  related  to  vacuum  polarization  in  the 
quantum  problem. 

This  prepares  us  for  Sections  5  and  6,  where  the  density  jump  is  introduced  into  our  study.  The 
possible  relevance  in  underwater  acoustics  of  the  classical  Lamb  shift  is  considered  in  Section  5.4,  and  the 
crucial  tools  needed  to  consider  Bragg  scattering  from  a  rough  interface  where  the  density  jumps  are 
considered  in  Section  6.1.  As  discussed  in  Section  6.1.3,  our  study  of  tilt-induced  smearing  will  be 
valuable  primarily  for  the  insights  it  gives  us  in  the  way  that  the  PE  buffers  singularities. 

4.1  A  Simple  Model  for  a  Sound  Speed  Discontinuity  at  the  Interface 

Here,  we  consider  an  interface  where  the  sound  speed  jumps  and  pursue  a  calculation  much  like  that 
for  the  cusp  (where  the  sound  speed  gradient  jumps)  in  Section  3.2.3. 


We  consider  the  standard  two-dimensional  ( x  —  z )  space,  where  (for  now)  the  sound  speed  is  constant 
in  the  half-spaces  I  and  II,  and  the  reference  sound  speed  is  the  sound  speed  in  Region  I.  The  two  regions 
are  separated  by  the  rough  surface  z  =  f  (x) .  The  conventions  for  this  problem  are  summarized  in  Fig. 
12. 


Field  that  carries 
downrange  flux: 


Region  I: 
Typically  water 


Physical  parameters  in  region  I: 
c,  =  sound  speed  in  region  I 


Density  is  constant:  p,  =  p„ 


x+  (downrange) 


f  =  *-L 

1  dx2 


Field  that  carries 
downrange  flux:  %u 


rough  interface  given  by:  z  =  f  (x) 


Region  II: 

Typically  the  bottom 


Physical  parameters  in  region  II: 
c„  =  sound  speed  in  region  II 


Mi  ci 

r-z 


Density  is  constant:  p,  =  pH 


Fig.  12  —  The  conventions  for  a  rough  interface  where  the  sound  speed  jumps.  Regions  I 
and  II  are  separated  by  a  rough  surface  z  =  f(x)  along  which  the  sound  speed  jumps.  As 
before,  jU  =  {l  —  n2^j2  (this  will  be  generalized  later),  where  n=c()/c  such  that  k  =  nk0. 
Here,  c0=  c, .  For  the  moment,  both  regions  are  still  assumed  to  have  the  same  density. 
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4.1.1  The  Boundary  Conditions  for  Quasi-first  Order  Theory 
Let  us  also  use  the  quasi-first  order  Hamiltonian 

H  =  kn  +  A  —  X/Skg  =  k0  +  A  +  ft /$k0  , 


where 


^  ^f1-^)2)/2 = [(!-(/<,  )2)/2]0H+/(x))=^0H+/(x))- 


(64) 


(65) 


Note  that  Eq.  (65)  simply  restates  the  fact  that  //  =  0  in  Region  1  and  f  =  jUu  in  Region  11. 
have 


fi  =  f  M„  S(~z  +  f)  =  f/u„S  ( z-f ) 
P  =  fM„S(z- /)-  f2ju„8'(z- /). 


Now,  we 


(66) 


So  now  we  have 

—  =  — —  jUnS'(z- f)  +  —  jUnS(z-f).  (67) 

8  k0  8  k0  v  ’  8  k0  v  ; 

These  8  -  and  S'  -functions  generate  perfectly  good  Hermitian  contact  potentials.  Now  use  Eq.  (59)  with 
k  —  1  to  get 


8\z-f)x{z)  =  S'(z-f)x{f)-S(z-f)  ^  - 

vZ  z=f 


(68) 


The  equation  of  motion  —id%/dx  =  H%  with  the  Hamiltonian  (Eq.  (64))  becomes 


11X-E!L^’(z.f)x(f)  +  E!LS(z-f)[h(f)  +  f 


2  k0  8  k0 


8  kn 


dZ 

dz 


+  ■■■=0 


z=f  ) 


(69) 


where  the  ellipsis  “  •  •  •  ”  stands  for  terms  that  are  at  worst  steps  and  are  delta  functions  or  their  derivatives. 
As  discussed  in  Section  3.3.4,  we  are  throwing  out  the  step  in  d%/dx  because  it  is  higher  order  in  the 

variables  /,/  and  because  it  is  an  artifact  of  the  unusual  power  counting  convention  implicit  in  the 
quasi-first  order  Hamiltonian  (Eq.  (64).  We  can  use  the  transverse  integrations  given  in  Eq.  (60)  with 
n  —  0  and  in  Eq.  (61)  to  get 


Xi-Xu  =^-f2x{f) 


dZ, 

dXn 

( 

—  8  u 

dz 

Z=f  dz 

*=/  4  l 

fx(f)+f 2?f 

dZ 


z=f  J 


(70) 


respectively.  Note  that  there  is  an  ambiguity  concerning  the  proper  way  to  evaluate  x{f)  ar*d 
PrM.  ,  since  these  quantities  jump  at  the  interface.  The  discontinuity  is  generated  by  5  -  and  S'  - 
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functions  that  are  multiplied  by  f  ’  and  /,  and  so  the  discontinuities  themselves  are  O  (  / ,  f  2  j .  When 
such  discontinuities  are  inserted  back  into  the  8  -  and  S'  -terms,  the  overall  order  of  the  ambiguity 
becomes  O^/4,/2/,/2 ) .  This  is  a  higher  order  than  concerns  us  in  the  present  context.  Thus  to 

o{fj2)  ,  on  the  right-hand  sides  of  Eqs.  (70)  we  can  choose  to  evaluate  X  anywhere  in  the  interval 

Ze\XnXii]’  and  ^Z /dz  anywhere  in  the  interval  d%/dz£  \^Zi Rz->^Zii ■  Anticipating  the 
intuitively  appealing  S  -function  bifurcation  result  (see  footnote  nn,  as  well  as  Section  5.2  and  Appendix 
K),  we  can  choose  to  evaluate  z{f)  and  [d^/rJz]  .  at  the  midpoint  of  the  intervals  (i.e.,  at  the 

averages  denoted  by  a  bar  “ X^Z /dz”).  This  would  leave  us  with 


f  _ 

Xu  =Zi  -—SuZ 

dz„  _  ^Zj  |  Mu  f  y  \  f 2 

dz  dz  4  [J/C 


(71) 


Recall  that  other  conventions  implicit  in  Eq.  (71)  were  summarized  in  Fig.  12.  (In  the  analyses  in  Section 
4.2,  and  then  again  beginning  with  Eq.  (83)  in  Section  4.3),  we  choose  to  simplify  our  work  a  little  bit  by 

selecting  Zi{f)  ar*d  [d^/rJz]  .  rather  than  the  average  values.  To  the  order  that  concerns  us  here, 
this  is  a  perfectly  valid  thing  to  do. 

4.1.2  The  Boundary  Conditions  Only  Conserve  Energy  When  Taken  Together  as  a  Pair 

Our  “toy  model”  provides  us  with  interesting  subtleties  concerning  energy  conservation.  For  example, 
since  8  and  8'  act  just  like  bona  fide  functions,  the  Hamiltonian  (Eq.  (64))  is  Hermitian,  and  so  energy 
must  be  conserved  by  the  resultant  boundary  conditions  (Eq.  (71)).  However,  recall  that  in  deriving  Eq. 
(71),  we  made  the  substitution  Eq.  (68): 

8'{z-f)z{z)  =  8'{z-f)z{f)-8{z-f)  - 

z=f 

These  two  terms  only  correspond  to  a  Hermitian  operator  times  the  field  X  when  taken  together.  Since 
the  S'  -term  gives  the  X  discontinuity,  and  the  8  -term  contributes  to  the  dx/dz  discontinuity,  we  can 
conclude  that  these  contributions  to  the  boundary  conditions  only  conserve  energy  when  taken  together  as 
a  pair.  Similarly,  a  close  examination  of  the  boundary  conditions  (Eq.  (7 1))  reveals  that  the  horizontal 
and  vertical  interfaces  in  a  discretization  conserve  energy  in  concert,  but  not  individually.  To  see  this, 
note  that  a  boundary  condition  of  the  general  form  X  aZ  cannot  conserve  energy  along  a  vertical 
interface,  since  X  is  basically  a  carrier  of  the  downrange  flux.  The  missing  energy  must  be  made  up 
along  the  horizontal  interface.  This  is  acceptable,  since  the  vertical  and  horizontal  interfaces  are  artifacts 
of  discretization,  and  our  formalism  actually  applies  to  a  sloping  interface. 
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4.2  The  Deterministic  Interface  and  a  Look  at  How  Boundary  Conditions  “Migrate” 
with  Increasing  Order 

4.2.1  A  Deterministic  Interface  with  Tilt,  but  No  Curvature 

To  explore  the  meaning  of  the  contribution  of  the  new  term  /l/Sk g  to  the  behavior  of  the  PE  near  an 
interface,  where  the  sound  speed  (but  not  yet  the  density)  jumps,  let  us  consider  an  interface  with  tilt 
( /  4=  0 ),  but  no  curvature  ( /  =  0 ).  As  mentioned  in  Section  2.3,  here  there  is  no  backscatter,  and  so  the 
PE  should  model  the  full-wave  result  to  arbitrary  precision.  Thus,  we  should  be  able  to  find  a  fairly 
straightforward  explanation  for  all  aspects  of  the  PE  result. 

4.2.1.1  Tilt-induced  Boundary  Conditions  are  Difficult  to  Interpret  in  Quasi-First-Order  Theory 

We  begin  with  the  simple  model  as  in  Eq.  (6) 

H  =  k0  +  A- Aj  8  =  k()  +  A  +  jl k^ 


and  note  that  for  a  tilted  interface  without  curvature,  the  resultant  boundary  conditions  (Eq.  (13))  now 
lead  to  the  boundary  following  conditions 


x„=y 

*Xu  _ 

dz 


f 


f 

1  4  ^ 


X, 


(72) 


(Here  we  are  exploiting  the  fact  that  %  and  d%/dz  are  continuous  at  0th  order  in/.) 

At  first  glance,  the  O  ( / 2 )  terms  contributed  by  the  new  term  jj.  / Sk 2  do  not  seem  to  make  any  sense. 

To  be  specific,  these  boundary  conditions  (Eq.  (72))  are  a  step  in  the  wrong  direction  compared  with  the 
boundary  conditions  that  emerge  from  the  simplest  possible  Hamiltonian  H  =  k0  +  A .  For  this 

Hamiltonian,  we  can  use  infinitesimal  integrations  to  show  that  %  and  3y//3z  are  continuous.  Thus, 
d%/dx  and  so  H %  are  continuous,  and  so  to  0(A) ,  so  is  A ,  and  consequently  so  is  the  0(  A) 

expansion  of  H  .  This  gives  us  continuity  of  A  and  3A/3z ,  where  A  solves  the  Helmholtz 
equation.  So  far,  this  is  exactly  what  we  would  expect. 

Now,  note  that  continuity  of  H also  gives  a  boundary  condition  on  and  so  also  on  Vj-A . 

Note  that  the  boundary  condition  on  V)-X  does  not  involve  /  ,  and  the  transformation  connecting  %  to 
A  only  involves  A%  and  ^ ,  so  it  cannot  introduce  a  dependence  on  / 2 ,  nor  can  a  further  operation  by 
the  Vj,  operator  to  take  us  from  an  equation  connecting  %  <->  A  to  one  connecting  <->  V^A . 
Thus,  the  boundary  conditions  on  V\x  and  V^A  are  the  same  as  for  a  flat  z  =  0  interface,  noting  in 
particular  that  there  is  no  O  (/2 )  connection.  In  Appendix  H,  it  is  shown  that  the  boundary  condition  on 
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VjA  should  in  fact  contain  a  term  that  goes  as  / 2 .  This  is  an  indication  that  the  O  [  /'2  j  term  that  leads 

to  boundary  conditions  (Eq.  (72))  is  ultimately  somehow  related  to  the  boundary  condition  on  V2TA  rather 
than  to  those  on  A  and  dA/dz  ■ 

Before  exploring  this  further,  let  us  note  a  very  interesting  aspect  to  the  result  in  Appendix  H.  The 
term  proportional  to  0  (. /  )  back  in  Eq.  (71)  (which  is  set  to  zero  in  Eq.  (72))  is  apparently  associated 

with  the  implicit  boundary  condition  on  d3A/dz3  ■  This  0(/)  term  induces  physical  effects  that  are 
implicit  in  full-wave  theory,  but  have  to  be  made  explicit  in  the  context  of  the  PE°°.  The  new  physical 
effect  introduced  by  the  O  (f\  term  appears  to  involve  coupling  to  interface  waves.  These  interface 

waves  are  briefly  discussed  in  Appendix  H,  and  the  effect  is  be  examined  again  in  Section  5  and  in 
Appendices  N.2  and  O.  For  the  moment,  it  is  valuable  to  develop  a  better  understanding  of  the 

comparatively  straightforward  O  (/2 )  terms  without  having  to  become  enmeshed  in  these  issues.  That  is 

the  reason  we  concentrate  on  a  surface  without  curvature  here. 

Before  seeking  the  needed  O (/2)  contribution  to  the  boundary  conditions  in  the  exotic  new  terms 
generated  by  the  FW  transformation,  let  us  first  eliminate  the  conventional  higher-order  terms  in  the  PE 
(i.e.,  those  that  come  from  expanding  the  square  root  operator  yj  1  +  2  A /k0  )  as  possible  sources  for  the 
missing  contribution  to  the  boundary  conditions.  Considering  the  0(A2  j  Hamiltonian 
H  =  k0  +  A  —  A2 1 2k0  ,  we  can  use  the  standard  infinitesimal  integrations  to  find  that  %  and  /dz  are 
constant,  and  that  the  boundary  conditions  on  and  V\x  come  from  8  -  and  S'  -functions 

generated  by  cross-terms  in  A2 .  These  are  exactly  the  same  boundary  conditions  as  those  for  a  flat 
horizontal  surface  ( /  =  0 ),  and  there  is  no  source  for  O  (/2 )  terms  in  the  boundary  condition  for  . 

The  transformation  from  the  auxiliary  field  %  back  to  the  actual  pressure  field  A  involves  powers  of  A 
(the  transformation  is  performed  an  infinitesimal  distance  on  either  side  of  the  interface),  and  again  there 
is  no  source  for  terms  explicitly  proportional  to  / 2 . 

4.2.1.2  Using  C)(/P,  Aj  Theory  to  Understand  the  Tilt-induced  Boundary  Conditions 


Now,  let  us  add  the  new  FW  —  A/S k$  =  jU/SkQ  term  to  the  second  order  Hamiltonian  to  get 


H  =  k0+ A 


AL 

2k0 


A 

8^o  * 


00  This  is  somewhat  similar  to  backscatter  as  discussed  in  Appendix  B.  Note  that  /  is  the  curvature  of  the  surface, 
and  in  a  sense  (for  the  shallow  grazing  angles  that  we  are  implicitly  restricting  ourselves  to  here,  and  operating 
within  the  PE  picture  based  on  small  perturbations  from  range  independence)  backscatter  is  basically  induced  by  the 
curvature  of  the  surface,  and  not  the  tilt.  (Recall  that  with  shallow  tilt  and  grazing  angles,  an  interface  with  tilt,  but 
no  curvature,  will  not  backscatter.) 
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This  qualitatively  recovers  the  expected  behavior.  The  term  proportional  to  jS'(z-f)  generates  a 

jump  in  V\x  ■  but  infinitesimal  integration  reveals  that  there  is  no  jump  in  X  and  ^X / dz  (we  would 
need  terms  proportional  to  a  8"'  and  8",  respectively,  to  generate  jumps  in  these  quantities).  Expanding 

the  so-called  WKB  operator  H  ^  =  (1  +  2A/ k(]  )  4  to  0(A2J ,  and  using  it  to  convert  from  X  back  to 

A  on  either  side  of  the  interface,  there  are  cancellations  between  the  implicit  jump  in  V\x  and  the 
explicit  one  in  V\%  so  that  A  and  dA/dz  remain  continuouspp.  Using  the  usual  infinitesimal 
integration,  it  is  easy  to  show  that  to  within  a  factor  of  2,  the  boundary  condition  on  V/X  that  is 
proportional  to  / 2  is  indeed  like  that  expected  for  V2TA  (and  given  in  Eq.  (H.9))  Vj.A  depends  not  only 

on  VjX  but  also  on  V\x  and  ^tX  •  and  so  the  precise  relationship  between  Vj,A  and  V^X  's  quite 
complicated.  A  rigorous  as  opposed  to  qualitative  rederivation  of  Eq.  (H.9)  would  involve  much  more 
effort  than  is  warranted  here.  For  our  current  purposes,  it  is  enough  that  we  have  been  able  to  demonstrate 
qualitatively  that  the  new  FW  terms  are  indeed  needed  to  reproduce  the  boundary  conditions  for  V2  A 

that  properly  include  a  component  that  is  proportional  / 2  (as  in  Eq.  (H.9)). 

4.2.1.3  A  More  Informed  Second  Look  at  Quasi-First-Order  Theory 

Now,  if  we  drop  the  “/l2/2k()  ”  term,  and  just  use  the  quasi-first-order  Hamiltonian 


then  the  formalism  retains  a  memory  of  the  jump  in  V^X  •  albeit  in  modified  form.  Recall  that  the  quasi- 
first  order  Hamiltonian  generates  jumps  in  X  and  dx / (Eq-  (70)) 


Xi  Xn 

dl,  dl„ 
dz  dz 


f 

~—MnZi  =0 

f2 

+  ^MuZ,=  0 


(73) 


pp  Here  we  demonstrate  that  the  cancellations  needed  so  that  A  and  dA/dz  like  X  and  dx/dz  are  continuous  in 
0(T2,T)  theory.  First  consider  the  artificial  problem  where  the  interface  is  flat,  but  we  insert  by  hand  the  O ( / 2  j 
boundary  condition  for  the  fixed  value  of  /  (recall  we  are  here  considering  the  tilted  interface  with  a  constant 
slope).  This  range-independent  problem  is  now  an  eigenvalue  problem.  Decompose  X  into  eigenvectors  xm  >  and 
use  x,„  continuity  to  show  that  //  =  A  is  continuous.  Thus,  for  this  case,  the  boundary  conditions  on  in 

VlxXz,  and  X  must  lead  to  algebraic  identities  that  conspire  to  produce  cancellations  such  that  A  (and  similarly 
dA/dz )  are  continuous.  Now,  restore  a  true  tilt  and  show  the  same  algebraic  identities  still  apply.  The  boundary 
conditions  on  V  /  x  ,  V/-X ,  dx / dz, ,  and  X  must  all  remain  the  same,  since  these  all  come  from  S  -functions  in  the 
Hamiltonian  (or  in  the  case  of  continuity  conditions,  the  lack  thereof).  As  always,  we  can  use  tangential 
differentiation  to  show  that  continuity  of  X  and  dx/dz  imply  continuity  of  dx/dx ,  which  via  the  PE  implies 
continuity  of  H x  ,  and  this  also  fixes  the  boundary  condition  on  V/x  ■  These  fix  the  algebraic  relations  between 
Vatx  ,  V\X'  and  X  '  and  we  already  know  that  these  lead  to  cancellations  in  the  0(2 2  j  transformation  from  ^  to  A 
that  produce  the  continuity  of  A. 
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If  we  take  the  tangential  derivative  (oc  d/dx  +  /  d/dz )  of  the  first  equation  in  Eq.  (73),  we  have 
(recalling  /  =  0 ) 


dl, 

dx 


_ A  _ 

dx  4  dx  {  dz  dz 


dl, 


dZ„),f Ml  =  0 

A  Mil  -S  J 
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dz 


Using  the  second  boundary  condition  and  as  always  retaining  only  terms  to 
rescaling 


( 


V 


dx 


dZn  ' 

dx 


0(Dqq,  we  have  the 


Substituting  back  into  the  PE,  it  follows  that  H%  obeys  the  same  rescaling  and  we  now  have 


f  i  2  ^ 


1  J—n 
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VtZi  ^ tZ ii  2k0  HuZii 


or 


f 

VtZi  -VtZu  —  4  MuVtZ,  —2k0/JnZir 


Again  noting  that  H  ^ X  —  A ,  and  more  specifically  that  in  quasi-first-order  theory  this  gives  us  in  the 
half-spaces  A  =  H  x  =  ( k0  -  A/ 2 )  X  -  we  can  see  that  A  (and  its  transverse  derivative)  picks  up 

similar  behavior.  We  no  longer  have  a  boundary  condition  similar  in  form  to  Eq.  (H.9).  We  have 
discovered  that  quasi-first-order  theory  picks  up  jumps  in  X  and  A  to  compensate  for  modifications  to 

the  jumps  in  V2rX  and  V^A . 

In  other  words,  quasi-first-order  theory  is  characterized  by  jumps  in  X  an^  A  that  do  not  appear  in 
the  full-wave  theory  -  these  jumps  are,  however,  placeholders  for  a  bona  fide  physical  effect  that  is 
present  in  full-wave  theory.  In  that  sense,  the  effect  is  real  and  must  be  taken  seriously.  The  same  is  true 

for  the  cusp,  and  the  atomic  Lamb  shift.  For  example,  for  the  cusp,  the  tilt -generated  (i.e.,  /  )  contact 

potential  is  really  related  to  a  jump  in  V^A ,  and  so  the  boundary  condition  on  V7  A  that  was  obtained 
earlier  is  a  residual  of  this  effect.  A  close  look  at  the  derivation  of  the  traditional  (transverse)  Lamb  shift 
(i.e.,  the  one  proportional  to  fcg(/2))  also  reveals  that  it  is  related  to  a  discontinuity  in  the  third 

derivative  V^-X  ■  Nevertheless,  we  know  from  quantum  mechanics  that  the  predicted  contact  potential 
indeed  leads  to  an  observable  physical  effect,  even  when  inserted  into  the  standard  (nonrelativistic) 
Schrodinger  equation.  This  teaches  us  that  the  artificial  boundary  conditions  of  quasi-first-order  theory 
can  be  productively  used  to  model  the  legitimate  physical  result. 


qq  Recall  that  at  0(  /'  ;  j ,  the  quasi-first-order  model  would  begin  to  pick  up  very  problematic  5  -functions  in 
dz/dx . 
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4.2.2  Migrating  Boundary  Conditions 

4.2.2.1  Changing  Boundary  Conditions  as  the  Lead  Order  of  the  PE  Changes 

The  interpretation  of  a  contact  potential  in  terms  of  induced  boundary  conditions  depends  on  its 
context.  For  example,  consider  what  happens  as  we  modify  the  order  p  (p  =  m/2  following  the  notation 

of  Section  3.3.2)  of  the  expansion  of  the  square  root  operator  yl  +  2X/k0  .  Then,  rather  obviously,  the 
leading-order  term  A,p  in  the  Hamiltonian  II  changes,  and  with  it  the  leading  order  derivative  of  the 
field  V2tp % j (2k{)y  .  We  have  just  seen  in  Section  4.2.1  that  as  this  happens,  a  given  fixed  contact 
potential  will  induce  different  boundary  conditions.  In  particular,  we  saw  how  the  boundary  conditions 
induced  by  a  S'  -function  potential  morphed  from  discontinuities  in  V\x  and  ^\X  only  hi  to  a  different 
and  somewhat  larger  set  of  discontinuities  that  now  also  includes  the  discontinuity  of  the  wave  function 
X  itself  and  its  first  derivative.  The  implications  of  this  statement  are  illustrated  in  Fig.  13. 


The  field  %  as  a  function  of  transverse  coordinate  z 


X 

1st  order  w/tilt:  H  =  k„  +  A - 

8  K 
X2  X 

2nd  order  w/tilt:  H  =kn+ A - - 

2  k0  8  k20 

A2 

2nd  order  w/o  tilt:  H  =kn+ A - 

2  K 


Fig.  13  —  The  FW  transformation  introduces  a  term  proportional  to  A  into  the  Hamiltonian  H.  This  term  becomes  a 
5'  -function  along  a  tilted  interface  where  the  sound  speed  jumps.  In  the  context  of  second  order  (i.e.,  0(A2  j ) 
theory,  this  tilt-induced  contact  potential  introduces  an  expected  jump  in  boundary  conditions  for  the  second 
derivative  of  the  auxiliary  field  %:  d2%/dz2  .  In  the  context  of  first  order  (i.e.,  0(2))  theory,  this  tilt-induced 
effect  changes,  and  partly  becomes  a  jump  in  the  auxiliary  field  %  .  The  behavior  of  the  corresponding  full  acoustic 
field  A  would  be  similar.  Note  that  the  sketch  is  an  illustration  of  the  concepts  involved,  and  not  an  actual  numerical 
calculation.  (The  numerical  calculation  of  these  concepts  will  be  a  topic  for  follow-on  research.)  For  the  purposes 
of  comparison,  the  field  behavior  with  the  A  term  entirely  removed  is  sketched  as  well. 


The  S'  -function  potential  discussed  in  Section  4.2.1  and  in  Fig.  13  results  when  the  new  FW  term  was 
evaluated  at  an  interface  where  the  sound  speed  jumps.  For  the  case  of  the  cusp  (i.e.,  a  jump  in  the 
gradient  of  the  sound  speed;  see  Section  3.2.3),  replace  the  field  in  Fig.  13  with  its  transverse  derivative 
Xyz)  — >  dx/dz  ■  Recall  that  a  similar  contact  potential  occurs  in  the  stochastic  problem,  where  it  is 
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called  a  “Lamb  shift.”  For  the  classical  Lamb  shift  associated  with  a  sound  speed  cusp  (see  Section  3.2.3), 
replace  the  average  field  with  its  derivative  (jjf)(z)  — >  d\%)/dz  ,  and  for  the  “toy  model”  of  the  atomic 

Lamb  shift  (see  Section  3.2.4),  replace  the  stochastic  field  in  Fig.  13  with  its  gradient  (tp'j  — >  V Up) . 

Note  that  in  higher-order  theory,  a  contact  potential  acting  at  a  point  leads  to  a  nonlocal  change  in  the 
field  that  can  be  interpreted  as  a  diffuse  cloud.  Specifically,  the  higher-order  boundary  conditions  are 
changed  in  a  way  such  that  the  wave  function  is  “launched”  differently.  This  modifies  the  behavior  of  the 
field  not  just  at  the  one  point  where  the  contact  potential  is  located,  but  also  in  the  vicinity  of  the  point. 
The  difference  between  the  field  with  and  without  the  contact  potential  present  can  be  interpreted  as  a 
cloud  in  the  vicinity  of  the  contact  potential.  Thus,  we  see  that  this  physically  real  cloud  is  ultimately 
touched  off  by  boundary  conditions  on  higher-order  derivatives,  which  are  implicit  in  full  wave  theory, 
and  explicit  in  higher-order  PEs.  At  lower  order  theory  (say  in  the  world  of  the  nonrelativistic 
Schrodinger  equation),  the  contact  potential  induces  a  local  field  discontinuity  in  lieu  of  the  effect  of  the 
diffuse  cloud.  This  does  not  look  much  like  a  cloud,  but  it  provides  a  model  of  sorts  for  the  effects  related 
to  the  physically  real  cloud. 

4.2.2.2  General  Principles  Regarding  the  Migration  of  Boundary  Conditions 

Let  us  now  draw  a  general  conclusion  based  on  the  set  of  examples  we  have  just  considered.  We 
stipulate  that  boundary  conditions  are  in  reality  contact  potentials,  and  then  we  take  the  consequences  of 
that  statement  seriously.  These  assertions  mean  that  as  we: 

■  convert  the  boundary  conditions  into  contact  potentials, 

■  add  terms  that  increase  the  order  of  the  lead  derivative  in  the  differential  equation, 

■  translate  the  contact  potentials  (e.g.,  8  -functions)  back  into  explicit  boundary  conditions  on  the 
wave -function  and  its  derivatives, 

>  then  we  find  that  the  given  phenomenon  at  the  interface  now  influences  the  solution  via  the 
boundary  conditions  on  higher-order  derivatives  than  was  previously  the  case.  In  other  words,  it 
migrates  up  to  higher-order  transverse  derivatives  of  the  wave  function. 

The  converse  happens  if  we  remove  the  leading  order  derivative  in  a  differential  equation: 

>  a  given  effect  will  now  induce  explicit  boundary  conditions  on  lower-order  derivatives  of  the 
solution  to  the  wave  equation.  In  other  words,  it  migrates  down  to  lower-order  transverse 
derivatives  of  the  wave  function. 

Thus,  boundary  conditions  migrate  as  leading  order  derivatives  are  added  or  subtracted. 

In  the  discussion  just  concluded  here  and  in  the  previous  subsection,  we  considered  an  effect  that 
occurs  naturally  in  O  (^2)  theory,  and  migrated  down  to  the  O  (A'j  theory.  Specifically,  when  the 

O  (^2)  PE  is  used,  the  effect  spawns  associated  boundary  conditions  that  are  obviously  related  to  the 
corresponding  boundary  condition  for  the  solution  of  the  Helmholtz  equation:  there  is  a  discontinuity  in 
the  second  derivative  of  the  wave  function:  .  However,  the  effect  migrates  down  to  the  O(A) 

theory  in  a  very  nonintuitive  way:  for  example,  there  is  now  also  a  discontinuity  in  the  wave  function  % 
itself.  However,  we  saw  that  the  O  (/l)  theory  retains  physical  significance. 
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Later  in  Section  6.1  as  we  examine  density  jumps,  we  come  across  an  effect  that  naturally  migrates  up 
from  O  [A)  theory  to  the  O  (A2  j  (and  higher)  PE.  This  turns  out  to  be  the  most  important  application  of 

the  concept  of  migrating  boundary  conditions,  so  it  is  worthwhile  to  take  a  moment  to  foreshadow  the 
following  result:  The  known  attributes  of  the  solution  to  the  (full-wave)  Helmholtz  equation  suggest  that 
the  wave  function  % ,  which  is  a  carrier  of  downrange  flux,  should  be  discontinuous  along  a  tilted 

interface  where  the  density  jumps.  This  discontinuity  should  already  be  apparent  in  O  ( A)  theory. 
Nevertheless,  the  discontinuity  in  %  is  automatically  forced  upward  into  the  (9  ( 9 2  j  theory  (and  higher 
orders)  by  the  structure  that  the  FW  procedure  imposes  on  the  PE.  In  this  context,  the  discontinuity  in  % 
automatically  migrates  up  to  become  a  discontinuity  in  its  second  derivative:  V\x  ■  The  needed 
discontinuity  in  %  can  only  be  placed  into  O  ( A )  theory  by  hand,  and  then  only  in  an  incomplete  form 
that  is  not  entirely  consistent  with  the  other  physical  demands  on  the  theory. 

4.3  Sound  Speed  Discontinuity  Along  a  Stochastic  Interface:  The  Lamb  Shift  for  a 
Sound  Speed  Jump 

Here  the  calculation  for  a  sound  speed  cusp  given  in  Section  3.2.3  is  adapted  to  calculate  the  effective 
boundary  conditions  induced  by  a  sound  speed  discontinuity  along  a  stochastic  rough  interface.  Since  it 
both  reinforces  many  of  the  basic  insights  obtained  in  Section  3.2.3  and  additionally  provides  insights  into 
the  core  issue  of  our  study  -  incorporating  discontinuities  of  the  environmental  parameters  into  the  PE  - 
the  calculation  is  once  again  explicitly  considered  here  rather  than  being  relegated  to  an  appendix  as  are 
some  comparable  calculations  later  in  this  study. 


4.3. 1  The  Effective  Boundary  Conditions  for  the  Sound  Speed  Jump 

Begin  with  the  boundary  conditions  (Eq.  (71)).  Now  follow  the  stochastic  procedure  outlined  in 
Section  3.2.3.  Once  again,  convert  the  boundary  conditions  at  z  =  f  to  effective  boundary  conditions 
down  at  z  =  0  by  performing,  independently  on  each  of  the  two  sides  of  the  interface,  Taylor  series 
expansions  of  the  function  X  and  its  derivative.  Incorporating  boundary  conditions  (Eq.  (71)),  we  have 
an  updated  version  of  Eq.  (32): 
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whereas  for  the  cusp,  we  used  the  boundary  condition  (Eq.  (71))  on  dx /dz  ,  but  now  it  is  nonzero  at  first- 
order  in  /  .  Similarly,  we  see  that  X  is  continuous  at  the  surface  z  =  f  to  O  (  /  2  j ,  so  the  tangential 
derivative 
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is  continuous  to  0(/  ^ .  Since  d%/dz  is  continuous  to  0  f  /  ) ,  d%/dx  at  the  surface  is  continuous  to 
0(/2,/-/),  and  from  the  PE,  so  must  H X  ■  Evaluating  H I%1  —  Hn%n  =  0 ,  we  have  to 

Of./",/' •/) 
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Similarly,  modifying  Eq.  (35)  we  find: 
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Again  adapting  the  technique  originally  developed  by  Kuperman  for  a  slightly  different  context  [65], 
break  the  wave  function  into  coherent  and  incoherent  parts:  z\—0  =  {z)  (0)  4  &X  -  ar*d  average  these 
boundary  conditions.  This  gives  us 
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Note  that  quite  generally,  +  =  0,  so  the  downrange  Lamb  shift  for  a  sound  speed  jump 

disappears  immediately  from  Eq.  (78). 

4.3.2  Converting  the  Smearing  Term  to  Familiar  Form 

Now,  let  us  look  at  the  smearing  term  in  the  boundary  condition  for  d  (x) /dx  given  by  the  last  part  of 

Eq.  (78).  This  term  involves  the  boundary  condition  for  the  third  derivative  d  '  (x)/dx3 .  To  obtain  this, 

flatten  the  interface  so  that  f  —  0,  and  take  the  transverse  derivative  Vr=d/dz  of  the  stochastic 
Helmholtz  equation  on  either  side  of  the  interface: 
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and  evaluate  the  second  equation  an  infinitesimal  distance  on  both  sides  of  the  interface  and  subtract.  So 
far,  Eq.  (79)  is  the  same  as  Eq.  (38).  However,  here  we  are  assuming  that  the  sound  speed  is  constant  in 

the  half  spaces,  so  Vr  [//]  (z)  =  0  away  from  the  interface.  On  the  other  hand,  [//]  now  jumps  at  the 

interface.  Also,  to  O  (/°) ,  Vr  is  still  continuous,  so  subtracting  Eq.  (79)  evaluated  just  inside  the  two 

regions  gives  us 
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4.3.3  Terms  Associated  with  Diffuse  Scattering 

Note  that  unlike  for  the  cusp  (e.g.,  Eq.  (40)),  there  are  now  lossy  diffuse  (Bragg)  scattering  terms 

(proportional  to  //  8%^  and  (fS%j)  in  the  boundary  conditions  for  the  coherent  field.  Although  they 

will  be  ignored  for  the  moment  so  that  we  may  concentrate  on  the  purely  coherent  effects,  these  terms 
provide  a  window  into  a  very  interesting  set  of  physical  processes.  They  both  contribute  an  overall  energy 
sink. 


§X  is  typically  proportional  to  /  ,  and  it  follows  that  the  term  proportional  to  8%j  is  proportional 

to  (/2)  •  This  term  corresponds  to  the  well-known  Bragg  scattering  phenomenon  associated  with  the 

first-order  perturbation  result  of  rough  surface  scattering  theory  (for  an  general  overview  of  first-order 
perturbation  theory  and  Bragg  scattering,  see,  for  example,  Ref.  73).  S.  McDaniel  presents  a  nice 

discussion  bringing  the  Bragg  scattering  term  lf8%\  (for  a  rough  surface  where  the  sound  speed  jumps) 

into  the  context  of  the  PE  (Ref.  74,  Section  II.A).  As  in  the  discussion  here,  she  bases  her  discussion  on 
the  normal-mode  work  of  Kuperman  [65]. 

The  term  proportional  to  (fS%J  is  particularly  interesting.  It  describes  curvature-induced  diffuse 

Bragg  scattering,  and  again  noting  that  8%  is  typically  proportional  to  / ,  this  type  of  incoherent 

scattering  phenomenon  is  proportional  to  [j  •  f^j  =  —  .  This  quantity,  the  mean  square  of  the  slope, 

is  related  to  the  need  for  renormalization  in  scattering  theory.  This  very  interesting  topic  is  explored  a 
little  further  in  Section  4.4. 


4.3.4  Pure  Coherent  Field  Effects 

4.3.4.1  The  Effective  Boundary  Conditions  on  the  Coherent  Field 

Until  then,  however,  we  will  drop  the  diffuse  scattering  terms  and  concentrate  on  the  propagation  of 
pure  coherent  field  effects.  This  is  not  unlike  dropping  the  diffuse  scattering  contribution  from  volume 
scattering  surrounding  (but  not  right  on  top  of)  the  sound  speed  cusp  of  Section  3.2.3  or  the  1/ r  potential 
in  the  atomic  problem  of  Section  3.2.4,  and  calculating  only  the  coherent  radiation.  We  also  need  to  keep 
in  mind  that  the  coupling  between  coherent  and  incoherent  radiation  should  be  relatively  modest,  even  if 
previous  incoherent  scattering  has  produced  a  great  deal  of  incoherent  radiation.  To  see  why  this  is  true, 

note  that  in  (^f 8%^j  and  8%^ ,  /  and  /  are  only  correlated  to  the  locally  generated  part  of  the 

incoherent  field  8x  (he.,  that  having  been  generated  within  a  distance  of  the  correlation  length  of  the 

interface),  and  not  that  generated  by  the  entire  surface.  The  local  part  of  the  incoherent  field  Sx  should 
thus  remain  relatively  modest,  and  the  Bragg  terms  should  never  overwhelm  the  rest  of  Eq.  (81).  Keeping 
all  this  in  mind,  we  eliminate  the  diffuse  scattering  terms  from  Eq.  (81),  and  are  left  with 


(Zi >(0) - (Xu){ 0)  =  -kl  (f2)juu  (x)(0)  +  o(f) 

d(x,)\ 


dz 


d{x„) 


z=0 


dz 


z=  0 


+ 


0(f) 


z= 0 


(82) 
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Once  again,  note  that  the  downrange  Lamb  shift  for  an  interface  has  disappeared  for  an  interface  with  a 
sound  speed  jump,  and  we  are  only  left  with  the  traditional  (transverse)  Lamb  shift  phenomenon. 

For  our  subsequent  analysis,  it  will  be  useful  to  take  advantage  of  the  fact  that 

C x)={x,)={x„)+o((f 2» 


to  rewrite  Eq.  (82)  as 


{x„)fi) = (l + K  (f2)l/n  ){x,  )(o) 


d(Xu) 


dz 


=(i -K(f2)vn) 


d{x.) 


z= 0 


dz 


z= 0 


(83) 


(Note  that  the  conventions  implicit  in  Eq.  (83)  are  illustrated  in  Fig  12.  Most  relevantly,  recall  that  the 
curve  z  =  f  (x)  separates  region  I  from  region  II,  that  the  reference  sound  speed  is  the  one  in  region  I, 

and  fl  =  {\  —  c\jc1^J'l.  This  implies  that  //,  =  0  and  )J.U  =  (l  —  c] / c2n  )/?■ .  The  positive  z  axis  points 
from  region  II  into  region  I.) 

4.3.4.2  Understanding  the  Coherent  Field  Boundary  Conditions 

An  interesting  interpretation  for  both  Eq.  (83)  and  the  earlier  result  (Eq.  (40))  is  illustrated  in  Fig.  14. 
A  sound  speed  jump  along  a  stochastic  rough  interface  looks  like  a  dipole  sheet,  while  a  jump  in  the 
sound  speed  gradient  along  the  same  surface  looks  like  a  monopole  (charge  sheet). 


»  -  +  c  +  ... 

Fig.  14  —  In  the  stochastic  problem,  a  rough  interface  decomposes  into 
contributions  that  look  like  a  monopole  sheet,  a  dipole  sheet,  etc.  A  jump  in  the 
sound  speed  gradient  generates  the  former,  while  the  dipole  is  generated  by  a 
jump  in  the  sound  speed  itself. 

Finally,  note  that  Eq.  (83)  indicates  that  the  stochastic  wave  function  (x)  is  discontinuous  at  the 

horizontal  interface  separating  the  two  regions.  This  boundary  condition  is  perfectly  compatible  with  the 
straightforward  eigenvalue  problem.  To  justify  this  general  assertion,  use  the  generic  field  %  rather  than 

the  stochastic  field  (%\ ,  and  assume  that  —id%  fix  =  H%  for  range -independent  II  and  that  Xs  ~  aXn 

along  a  horizontal  line  (say  z  =  0).  Now  Xi  ~aXu  =  0  along  the  line,  and  the  same  is  true  for  the 
tangential  derivative 

0  =  d  (Xi  -  aXn  )!dx  =  dXi  fix- a d%n  fix  . 

In  other  words,  %  and  dx fix  jump  in  tandem.  Recalling  that  a  given  eigenfunction  X„  obeys  the 
eigenvalue  equation  X„  =  K  fiXnfix)  >  note  the  tandem  jumps  in  X„  ail(3  3 Xnfix  are  crucial  if  this 


72 


Daniel  Wunnser 


equality  is  to  hold  at  the  interface.  While  this  may  seem  obvious  in  hindsight,  a  failure  to  recognize  this 
result  can  lead  to  considerable  confusion  as  we  pursue  our  analysis. 

4.4  The  “New  Physics”  in  Context 

The  new  physics  discovered  so  far  is  placed  into  a  broader  physical  context  in  Fig.  15.  In  this  section, 
we  explore  some  of  the  implications  of  the  figure. 


Fig.  15  —  The  classical  Lamb  shift  in  context.  The  PE  (cf.  the  Schrodinger  equation  in  atomic  physics)  is  used  to 
propagate  the  field  (cf.  the  electron  wave  function),  often  in  the  vicinity  of  a  rough  interface  (cf.  the  world  line  of  the 
nucleus  advected  by  vacuum  fluctuations).  Along  the  interface,  the  sound  speed  (or  more  generally  any 
environmental  parameter)  or  its  transverse  derivative  (in  atomic  physics,  the  gradient  of  the  potential)  is 
discontinuous.  It  is  well  known  that  the  rough  interface  couples  eigenstates  (or  modes  to  use  the  terminology  of  a 
classical  wave  guide).  The  most  pronounced  manifestation  of  this  is  Bragg  scattering.  The  stochastic  problem  also 
generates  a  shift  of  the  eigenstates  (and  associated  eigenvalues)  in  addition  to  the  mixing  of  eigenstates.  This  is  the 
Lamb  shift.  More  obscure  is  the  fact  that  the  slope  and  curvature  of  such  interfaces  spawn  a  new  class  of  effects  (for 
the  atomic  problem,  the  slope  of  the  world  line  corresponds  to  the  velocity  of  the  nucleus  as  it  is  advected  by 
vacuum  fluctuations,  and  the  curvature  is  the  acceleration).  For  instance,  tilt  buffers  the  interface  in  a  way  that  is 
very  reminiscent  of  the  Lamb  shift,  but  it  is  automatically  generated  by  the  deterministic  formalism  without  the  need 
for  stochastic  averaging.  The  formalism  built  up  also  suggests  a  method  for  modeling  an  interface  characterized  by  a 
density  jump.  Since  the  Lamb  shift  is  a  true  rough  surface  effect,  the  proven  success  of  the  method  in  modeling  this 
class  of  effects  validates  its  use  in  modeling  a  rough  interface  where  the  density  jumps.  As  discussed  in  Section  6.1, 
the  formalism  buffers  a  density  jump  much  like  it  buffers  a  tilted  interface.  The  aspects  of  the  problem  most  closely 
examined  in  the  current  paper  are  shaded. 
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4.4.1  The  Classical  Lamb  Shift  in  Context 


Note  that  the  Lamb  shift  emerges  from  the  stochastic  problem.  The  transverse  (or  traditional)  Lamb 
shift  is  proportional  to  the  mean  square  surface  height  (/2)  ^  and  it  is  a  result  of  smearing  of  the  interface 

in  the  transverse  direction.  This  is  a  purely  coherent  field  effect,  and  as  such  it  involves  mode  shifts 
rather  than  mode  mixing.  The  corresponding  eigenvalues  for  the  modes  (the  modes  are  eigenfunctions) 
shift  as  well.  This  is  noted  in  the  classic  quantum  mechanics  textbook  by  Cohn-Tanoudji  et  al.  [75], 
where  it  is  pointed  out  that  there  are  in  fact  two  types  of  effects  associated  with  the  time-dependence 
introduced  by  vacuum  fluctuations  (or  similarly  range-dependent  roughness  in  the  acoustic  waveguide 
problem): 

•  The  mixing  of  eigenstates  producing  transition  probabilities  and  characterized  by  time -dependent 
perturbation  theory.  (This  corresponds  to  Bragg  scattering  described  by  the  Born  series  in  the 
acoustic  scattering  problem.) 

•  The  Lamb  shift  modifying  eigenstates  —  i.e.,  time-averaging  creates  a  new  stochastic  problem 
that  is  effectively  static  (time-independent).  (For  the  acoustic  problem,  replace  “time”  with 
“range”  in  this  statement.) 

In  the  deterministic  problem,  a  similar  effect  is  introduced  by  surface  tilt.  Once  again  the  interface  is 
smeared,  this  time  by  the  shielding  (or  buffering)  by  a  cloud  of  virtual  particle  pairs  (i.e.,  bound  uprange- 
downrange  oscillations).  The  FW  transformation  automatically  generates  this  without  the  need  for 
explicit  stochastic  averaging.  This  forms  an  important  precedent,  which  will  help  us  understand  how  the 
PE  deals  with  Bragg-scale  vorticity,  an  effect  that  occurs  along  an  interface  where  the  density  jumps  (see 
Section  6.1).  Sometimes,  tilt-induced  buffering  survives  into  the  stochastic  problem.  This  is  called  the 
“downrange  Lamb  shift,”  because  it  is  caused  by  smearing  in  the  downrange  direction  rather  than  in  the 
transverse  direction  (as  is  the  traditional  Lamb  shift).  The  downrange  Lamb  shift  occurs  when  there  is  a 
sound  speed  cusp  along  a  stochastic  rough  surface  and,  as  we  will  soon  see,  apparently  a  weak  form  of 
this  effect  also  occurs  when  there  is  a  density  jump  along  such  an  interface. 

4.4.2  Terms  Proportional  to  Surface  Curvature  and  Tilt 

Figure  15  also  takes  note  of  the  fact  that  in  Eq.  (81),  there  is  an  exotic  new  curvature-induced  diffuse 
scattering  term:  —  (/Z^/4)  .  This  interesting  scattering  effect  couples  modes,  but  it  will  not  be 

considered  very  deeply  in  the  present  context.  However,  assuming  the  incoherent  field  is  proportional  to 
the  surface  height  (i.e.,  8%  / ),  the  term  involves  the  important  expansion  parameter 

{j  ■  f^j  —  ~{f2') ,  which  is  worth  a  closer  look. 


Recall  that  tilt-induced  buffering  associated  with  a  sound  speed  jump  is  proportional  to  j  (e.g.,  see 
Eq.  (72)),  and  similarly,  if  the  surface  corresponds  to  a  jump  in  the  first  derivative  of  the  sound  speed, 
there  is  also  a  virtual  cloud  that  generates  a  term  proportional  to  j  (see  Eq.  (31)).  The  Foldy- 
Wouthuysen  procedure  will  eventually  add  higher-order  downrange  derivatives  that  create  terms 
proportional  to  higher  powers  of  /  .  This  makes  /  a  true  expansion  parameter,  limiting  the  slope  /  <  1 

and  so  the  interface  grade  to  under  45  deg.  This  a  trait  shared  with  rough  surface  scattering  theories  based 
on  the  Kirchhoff  approximation  or  perturbation  theory.  For  the  sound  speed  cusp  and  apparently  also  for 
a  density  jump,  there  is  a  “downrange  Lamb  shift”  effect  that  is  proportional  to  the  mean  square  slope 

(f2/-  As  we  have  just  seen,  this  quantity  also  shows  up  in  the  stochastic  diffuse  scattering  problem. 
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where  we  would  obtain  a  term  proportional  to 


(/  '  [incoherent  field])  «=(/•/)  =  -(/2)  • 


The 


expansion  parameter  /  f 2 )  is  a  broad-spectrum  parameter  in  the  sense  that  it  is  sensitive  to  all  scales.  For 
the  typical  scenario  where  surface  roughness  is  characterized  by  a  power-law  tail  such  that  the  quantity 
) / 2 )  blows  up  (e.g.,  the  spectrum  S  (k)  ~  \lkp  ;  p  <  4  for  a  two-dimensional  rough  surface),  we  have 

to  impose  a  cutoff.  For  the  PE,  we  follow  the  convention  from  quantum  mechanics  (e.g.,  Ref.  57,  p.  60) 
and  cut  off  the  wave  number  at  the  reference  wave  number  k0 .  This  cutoff  is  associated  with  a  limit  of 

spatial  resolution.  From  the  uncertainty  principle,  an  allowed  wave  number  range  of  0  to  kQ  corresponds 

to  a  maximum  spatial  resolution  of  1/2 kQ  =Ai)/A K  ,  where  is  the  reference  wavelength.  The  sensitivity 

to  an  upper  cutoff  of  the  downrange  Lamb  shift  and  the  curvature-induced  diffuse  scattering  is  associated 
with  a  sensitivity  of  the  underlying  physical  effect  to  all  scales  down  to  this  length  scale. 


Several  more  remarks  are  appropriate  concerning  the  need  to  provide  a  cutoff  beyond  which  the  field 
cannot  resolve  features  of  the  surface.  There  are  two  related,  but  distinct  scenarios  at  issue  here: 


1.  The  first  is  when  ^/“)  and  perhaps  also  )/2)  blow  up.  Specifically  for  a  two-dimensional 

surface,  if  the  tail  of  the  spectrum  goes  as  a  power  p  <  2  ,  i^f1 )  blows  up  and  the  cumulative 
effect  of  small-scale  features  causes  the  surface  height  to  be  unbounded,  and  if  the  tail  of  the 
spectrum  goes  as  a  power  p  <  4  ,  the  mean  square  slope  //2)  does  not  exist  and  the  surface  is  a 
fractal.  Clearly,  some  cutoff  is  necessary  if  our  theory  is  to  avoid  unphysical  infinities11) 

2.  The  second  is  when  )/2)  and  (f~j  are  both  finite,  but  scales  very  much  smaller  than  the 

wavelength  nevertheless  make  a  major  contribution  to  these  values.  This  can  occur  if  the  power 
law  is  integrable  for  k  — »  °°  ,  or  if  some  external  physical  constraint  forces  the  spectrum  to  cutoff 

at  some  large  value  of  k  .  For  example,  when  p  >  2  ,  ^  f 2 )  is  finite  and  when  p  >  4,  then 

^/2)  is  finite,  but  there  may  still  be  a  significant  contribution  from  scales  that  are  very  small 

compared  to  the  wavelength  of  the  field.  Alternately,  note  that  for  all  naturally  occurring 
classical  phenomena,  the  spectrum  must  cut  off  at  some  scale  (at  latest  at  the  atomic  scale).  Even 

if  that  occurs  at  far  below  the  Bragg  scale,  ^  f 2 )  and  ^/2)  must  still  exist  even  if  p  <  4  and/or 

p  <  2  ,  and  so  the  surface  is  never  really  of  infinite  height,  nor  does  it  continue  to  behave  like  a 

fractal  all  the  way  down  to  the  smallest  scales.  When  ^ f 2 )  and  //2\  exist,  but  depend  on  the 

characteristics  of  the  surface  at  very  small  scales,  then  the  question  becomes:  do  the  scales  well 
below  a  wavelength  really  matter  as  far  as  the  field  is  concerned?  The  answer  is  no.  Since  the 
field  cannot  possibly  react  to  what  happens  at  scales  far  below  that  allowed  by  the  uncertainty 
principle,  it  cannot,  for  example,  directly  depend  on  the  nature  of  some  physical  cutoff  in  the  far 
UV  (i.e.,  large  k  ).  (It  can  indirectly  depend  on  these  scales  via  the  values  of  the  environmental 
parameters,  but  that  is  another  issue.)  The  theory  as  currently  constituted  is  thus  overly  sensitive 
to  small  scales. 


n  On  a  semantic  note,  once  we  introduce  the  cutoff  on  the  fractal  surface,  it  is  no  longer,  strictly  speaking,  a  true 
fractal.  It  is  now  known  as  a  self-affine  surface.  By  this  we  mean  that  the  surface  behaves  like  a  fractal  within  the 
length  scales  that  matter  to  the  physical  process  that  concerns  us.  Any  naturally  occurring  “fractal”  is  really  a  self- 
affine  surface. 
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Both  of  these  scenarios  imply  that  some  imposed  cutoff  at  the  limits  set  by  the  uncertainty  principle  is 
absolutely  necessary.  In  the  first  case,  that  fact  is  impossible  to  ignore.  This  happens,  say,  in 
unrenormalized  Quantum  Electrodynamics  (QED).  In  the  second  case,  the  issue  is  really  the  same  (an 
imposed  cutoff  is  necessary  because  small  scales  are  oversampled),  but  now  it  is  easier  to  ignore,  and  in 
fact  it  is  often  overlooked.  This  is  an  illustration  of  an  important  issue  Weinberg  addresses  in  his  field 
theory  text  [76,  p.  441]  “...  the  renormalization  of  masses  and  fields  has  nothing  directly  to  do  with  the 
presence  of  infinities,  and  would  be  necessary  even  in  a  theory  in  which  all  momentum  space  integrals 
were  convergent.”  In  other  words,  tractable  approximations  to  field  theory  tend  to  be  overly  sensitive  to 
small  scales,  and  they  need  to  be  cured  of  it. 

4.4.3  Understanding  the  Cutoff  as  a  Crude  Form  of  Renormalization 

Our  artificially  imposed  cutoff  is  a  crude  form  of  renormalization  (valid  in  the  PE  /Schrodinger 
equation  (PE/SE)  limit).  The  Lamb  shift  was  historically  the  phenomenon  that  first  raised  the  issue  of 
renormalization  [76,  pp.  31-38].  Interestingly,  as  noted  by  Weinberg  (p.  38),  “A  fully  relativistic 
calculation  of  the  Lamb  shift  including  positrons  in  intermediate  states  could  have  been  attempted  in  the 
1930’s,  using  the  old  non-relativistic  perturbation  theory”  [76].  In  this  report,  we  present  a  variant  on  the 
idea  that  classical  PE/SE  theory  can  be  used  to  obtain  Lamb  shift  phenomena.  It  is  based  on  Welton’s 
approach  as  described  in  Refs.  57  and  62,  but  additional  information  is  also  extracted  from  the  new  terms 
introduced  by  the  FW  transformation. 

Note  that  in  the  “toy  model”  of  the  atomic  Lamb  shift,  it  is  already  that  diverges,  and  not  only 

=  ( v 2 )  .  This  is  analogous  to  the  case  where  both  ^/'2^  and  (j1^  diverge  for  a  rough  one-  or 

two-dimensional  surface.  In  other  words,  the  roughness  of  the  vacuum  fluctuationsss  corresponds,  for 
example,  to  the  p  —  2  case  for  a  rough  two-dimensional  surface.  Renormalization  is  thus  necessary  to 
handle  divergences  in  both  components  of  the  Lamb  shift  that  appear  in  our  “toy  model.” 

It  is  also  instructive  to  approach  the  issue  of  renormalization  from  another  point  of  view.  In  the  “toy 
model”  of  the  atomic  Lamb  shift,  the  new  term  imposed  by  the  FW  transformation  is  associated  with  a 
virtual  electron-positron  pair  (note  the  quote  in  the  paragraph  above).  This  becomes  a  loop  diagram  in  the 
language  of  Feynman  diagrams,  and  it  is  loop  diagrams  that  force  the  issue  of  renormalization.  The 
electron-positron  loop  is  associated  with  a  phenomenon  known  as  vacuum  polarization",  and  it  is  the  root 

cause  behind  the  divergence  of  —  (  v 2 )  .  (The  loop  diagram  associated  with  vacuum  polarization  is 

shown,  for  example,  in  Fig.  8-4,  p.  152  of  Ref.  57.)  Similarly,  the  fluctuations  introduced  into  the 
Schrodinger  equation  by  hand  correspond  to  virtual  photon-electron  loops.  These  virtual  photon-electron 
loop  diagrams  also  diverge  unless  some  kind  of  renormalization  procedure  is  invoked,  but  more  slowly 
than  do  the  loops  generated  by  electron-positron  pairs.  The  photon-electron  loops  are  connected  to  the 
vertex  correction  term  and  to  some  extent  also  to  the  electron  self-energy,  and  these  loop  diagrams  (also 

shown  in  Fig.  8-4,  p.  152  of  Ref.  57)  are  the  root  cause  behind  the  divergence  of  /^r2\ .  The  photon- 

electron  loops™  (connected  to  the  transverse  Lamb  shift)  form  the  bulk  of  the  atomic  Lamb  shift  effect. 


ss  Recall  that  in  this  context,  the  fluctuations  are  imposed  artificially  by  hand  since  our  model  imposes  second 
quantization  as  an  ad  hoc  modification  to  first  quantization. 

"  Vacuum  polarization  also  involves  a  virtual  photon,  since  it  is  a  consequence  of  the  interaction  of  the  electron  with 
vacuum  fluctuations,  which  in  turn  involve  virtual  photons. 
uu  Recall  that  these  are  associated  with  the  transverse  Lamb  shift. 
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The  vacuum  polarization"  is  much  smaller,  and  it  has  the  opposite  sign.  This  leaves  a  small  shortfall  in 
the  effect,  which  is  mostly  made  up  by  the  contribution  from  an  anomalous  magnetic  moment  [77].  There 
are  also  small  higher-order  corrections. 

4.4.4  Towards  Incorporating  the  New  Physics  into  Underwater  Propagation 

Having  developed  a  general  understanding  of  the  context  of  the  classical  Lamb  shift  relative  to  other 
field  effects,  the  next  step  in  examining  this  effect  is  to  explore  the  significance  of  the  classical  Lamb 
shift  in  the  context  of  underwater  propagation.  The  rough  sound  speed  cusp  Lamb  shift  may  appear  when 
the  acoustic  field  is  confined  in  a  duct,  and  propagates  for  long  ranges  (see  Fig.  16).  The  ducting  occurs 
because  the  sound  speed  profile  is  downward  refracting  on  the  top  portion  of  a  layer  (or  alternately  the 
air-sea  interface  bounds  the  field  from  above),  and  upward  refracting  below,  and  thus  it  is  not  directly 
caused  by  the  cusp.  However,  this  ducting  causes  a  significant  amount  of  acoustic  energy  to  be 
concentrated  in  the  vicinity  of  the  cusp,  and  this  provides  the  cusp  with  an  opportunity  to  influence  its 
behavior.  This  may  cause  a  significant  phase  shift  of  the  acoustic  wave.  Futhermore,  the  rich  impedance 
boundary  conditions  induced  by  the  classical  Lamb  shift  for  a  cusp  permit  the  existence  of  interface 
waves  trapped  near  the  stochastic  rough  interface.  These  boundary  waves  form  a  part  of  the  coherent 
field  that  decays  exponentially  away  from  the  interface.  In  other  words,  some  fraction  of  the  acoustic 
energy  is  confined  to  the  immediate  vicinity  of  the  interface.  This  is  a  kind  of  localization.  For  a  one¬ 
dimensional  interface  embedded  in  two-dimensional  x  —  z  space,  this  is  a  one -dimensional  localization 
effect,  while  for  the  atomic  Lamb  shift,  we  would  have  localization  in  three-dimensional  space.  This 
explains  why  the  Lamb  shift  is  sometimes  tied  to  Anderson  localization  (for  example,  see  Ref.  78, 
especially  the  top  of  column  ii,  p.  39). 


Fig.  16  —  The  acoustic  Lamb  shift  associated  with  a  rough  cusp  may  appear  in 
long-range  propagation  associated  with  a  duct  formed  by  pronounced  near¬ 
surface  layering. 


vv  Recall  that  this  is  associated  with  the  downrange  Lamb  shift,  and  so  with  the  new  terms  introduced  by  the  FW 
transformation.  Note  that  this  implies  that  structure  of  the  field  equations  automatically  forces  vacuum  polarization 
once  photon  loops  such  as  the  vertex  term  are  introduced. 
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The  acoustic  Lamb  shift  associated  with  a  sound  speed  jump  may  affect  shallow  water  propagation  via 
the  interaction  of  the  field  with  a  rough  ocean  bottom.  Here  the  effect  shifts  the  downrange  wave  number 
of  the  coherent  field  in  a  way  that  may  accumulate  within  the  phase.  However,  the  sound  speed  jump  is 
typically  quite  modest  for  ocean  bottoms  that  acoustically  behave  like  a  fluid  such  as  mud,  sand,  and 
relatively  unconsolidated  rock  (as  opposed  to  elastic  solids  such  as  limestone  and  basalt),  while  the 
density  jump  can  approach  a  factor  of  two.  Thus,  to  properly  model  a  rough  ocean  bottom,  it  will  be 
necessary  to  consider  a  density  jump  as  well  as  a  sound  speed  jump.  In  the  next  section,  we  will  see  that  a 
straightforward  generalization  of  the  FW  ansatz  permits  us  to  include  a  density  jump. 

5.  INTRODUCING  A  DENSITY  JUMP 

The  density  jump  as  found,  for  example,  at  the  ocean  bottom,  is  introduced  here  in  Section  5.  In 
Section  5.1,  the  basic  components  of  the  FW  procedure  are  adapted  to  the  case  where  the  density  jumps, 
and  then  Section  5.2  discusses  8  -function  bifurcation,  an  important  new  tool  needed  for  adapting  the 
results  to  interfaces  where  the  density  jumps.  In  Section  5.3,  the  interface  where  the  density  jumps  is 
examined  for  the  first  time.  Tilt-induced  smearing  is  examined  in  the  context  of  the  deterministic 
problem,  and  then  the  resultant  insights  are  used  to  obtain  the  classical  Lamb  shift  associated  with  a 
density  and  sound  speed  jump.  Here,  as  with  the  sound  speed  cusp  but  unlike  the  sound  speed  jump,  we 
obtain  a  small  tilt/curvature -induced  “vacuum  polarization”  correction  to  the  classical  Lamb  shift.  The 
examination  of  the  classical  Lamb  shift  concludes  with  a  discussion  of  its  possible  relevance  to 
underwater  acoustics. 

In  addition  to  their  direct  effect  on  the  scattering  problem,  the  phenomena  examined  in  Sections  4  and 
5  also  teach  us  something  very  important  about  the  PE.  They  use  the  dominant  component  of  the  Lamb 
shift,  transverse  smearing  induced  by  stochastic  averaging,  to  develop  a  solid  understanding  of  stochastic 
smearing  (i.e.,  buffering)  of  a  singularity  associated  with  the  environmental  parameters.  Juxtaposing  this 
phenomenon  with  tilt-induced  smearing  in  the  deterministic  problem,  we  discover  that  the  PE  buffers 
(i.e.,  smears  out)  singularities  in  the  deterministic  problem  in  a  way  that  closely  mimics  the  buffering 
imposed  by  the  stochastic  problem. 

Section  5  also  serves  to  build  our  understanding  of  some  of  the  issues  associated  with  the  density 
jump.  In  Section  6,  the  knowledge  acquired  in  Sections  4  and  5  will  be  used  to  examine  a  very  important, 
but  also  very  subtle  issue  that  must  be  addressed  if  density  jumps  are  to  be  properly  incorporated  into  the 
PE. 


5.1  The  Basic  Formalism 


When  the  density  varies,  the  problem  generalizes  in  a  rather  straightforward  manner. 


below,  the  changes  associated  with  density  variation  are  enclosed  in  boxes) 


In  the  formulas 


The  formal  development  in  this  section  applies  equally  to  two-dimensional  and  three-dimensional 
spaces.  Interfaces  are  explicitly  introduced  in  Section  5.2.  Once  interfaces  are  considered,  consideration 
will  once  again  be  restricted  by  convention  to  two-dimensional  spaces. 


5.1.1  The  Ansatz 


Recalling  Eq.  (1),  the  Helmholtz  equation  generalizes  to 
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where  p  is  the  density.  Now,  the  Foldy-Wouthuysen  ansatz  becomes 
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where  p0  is  some  reference  density.  The  time-averaged  downrange  energy  flux 
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is  proportional  to  a  factor  of  1/ p  and  this  is  built  into  6  and  % ,  so  that 

M2-kf=-2  P0c0Sx, 


(85) 


and  so  9  and  %  are  indeed  carriers  of  flux  as  before  (the  reference  density  pQ  and  the  reference  sound 
speed  c0  =  6)/k0  are  just  constants).  In  Appendix  I,  it  is  shown  that  the  state  space  equation  is  once  again 

i—  =  H®  with  n  =  O  +  £  +  k0rj, 
dx 


where  now 
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with  the  matrices  //  and  E,  as  before,  and 
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Note  that  now  U  is  defined  in  terms  of  the  compressibility  K  =  1 j(c2 p^  rather  than  the  sound  speed 
squared,  but  the  definition  for  p  reduces  to  the  previous  expression  if  the  density  is  everywhere  the 
same.  K0  is  some  reference  compressibility.  Note  also  that  A  (the  coefficient  of  the  odd  matrix  £ )  is 
still  Hermitian,  but  it  acquires  an  extra  term  related  to  the  departure  of  the  local  density  p  from  its 
reference  value  p0 .  The  state  space  equation  is  very  similar  to  what  it  was  when  the  density  was 
everywhere  the  same,  but  now  the  even  operator  £  has  also  acquired  an  extra  factor  proportional  to  y 
that  measures  how  much  the  local  density  differs  from  its  reference  value. 
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5.1.2  The  Diagonalized  Hamiltonian 

To  obtain  the  diagonalized  matrix  Hamiltonian  TL  to  fourth  order,  consider  the  general  results  shown 
in  Eqs.  (14)  and  (15),  which  are  reproduced  here  for  convenience: 


where 
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and  insert  the  definitions  for  £  and  O  from  Eq.  (86).  This  gives  us  TL  to  fourth  order  (counting  powers 
of  A,  y  and  d/dx  (acting  on  A, y)  as  orders): 
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The  details  of  the  derivation  of  Eq.  (90)  are  given  in  Appendix  J.l  on  the  CD.  Note  that  we  wave -dropped 
the  tilde  and  the  superscript  TV  .  (Recall  that  the  commutator  [,]  and  anti-commutator  {,}  are  defined  in 
footnote  f.) 

The  next  step  is  to  modify  Eq.  (90)  for  the  special  case  where  the  density  variation  is  a  jump.  This  is 
particularly  useful,  since  the  density  is  often  fairly  constant  within  a  given  material,  but  jumps  as  one 
passes  from  one  material  to  another.  As  long  as  the  transition  region  from  material  to  material  is  very 
small  on  the  order  of  a  wavelength  (and  it  often  is,  particularly  in  the  context  of  low  frequency  acoustic 
scattering  from  the  ocean  bottom),  it  is  a  good  approximation  to  model  the  jump  in  the  density  function 
by  a  step  function  along  a  precisely  defined  interface  dividing  the  two  materials.  However,  before 
proceeding  to  develop  a  form  of  the  Hamiltonian  TL  given  by  Eq.  (90)  fine-tuned  for  such  density  jumps, 
we  have  to  explore  a  very  important  formal  result:  8  -function  bifurcation. 

5.2  8  -function  Bifurcation 


Once  steps  are  introduced  into  the  environmental  parameters,  the  field  theory  will  invariably  generate 
scenarios  where  distributions  are  multiplied.  From  a  mathematical  standpoint,  there  is  no  completely 
general  way  to  define  such  quantities.  However,  it  is  possible  to  develop  empirical  rules  for  interpreting 
these  quantities  in  the  context  of  field  theory.  This  reflects  the  fact  that  the  mathematics  in  this  context 
describes  underlying  physical  effects  that  are  well  behaved. 
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5.2. 1  The  Bifurcation  Rules 

To  deduce  the  rules  for  the  multiplication  of  distributions,  consider  two  prototypical  problems,  which 
cover  the  key  physical  circumstances  of  interest  in  the  current  discussion,  and  where  we  know  an 
acceptable  interpretation  for  a  product  of  derivatives  must  exist.  Specifically  demand  that: 

■  It  must  be  possible  to  use  the  chain  rule  in  the  full  wave  equation  to  recover  the  boundary 
conditions  for  an  interface  where  the  density  jumps: 

V •  ( /T’VA)  +  k°n  A  =  0^>  continuity  of  A,— — 

v  ’  p  p  dn 

■  In  the  constant  density  PE  {8p  =  0 ),  where  the  sound  speed  jumps  along  a  flat  (range- 
independent)  surface,  adding  orders  to  the  PE  recovers  the  explicit  boundary  conditions  for  the 
higher-order  derivatives  of  the  field  (i.e.,  the  boundary  conditions  on  Vfff  ,  n  >  2  )ww. 

These  conditions  force  8  -function  bifurcation.  This  means  that  the  first  time  a  step  function  is 
differentiated  to  form  a  8  -function,  that  8  -function  bifurcates  into  two  half-  8  -functions  an 
infinitesimal  distance  on  either  side  of  the  interface: 

90(^-/)  w  8+(z-f)  8~{z-f) 

Now,  other  factors  in  the  product  are  no  longer  distributions,  since  all  are  evaluated  just  inside  the  half¬ 
space  on  either  side  of  the  interface. 

5.2.2  8  -function  Bifurcation  in  Other  Contexts 

This  result  has  in  the  past  appeared  in  another  context.  Green’s  function  integral  formulas  for  the 
solution  to  the  Helmholtz  equation  in  a  bounded  region  pick  up  factors  of  Vi,  and  this  ultimately  comes 
from  a  similar  8  -function  bifurcation  at  the  normal  derivative  of  the  Green’s  function  on  the  boundary 
[79-81].  (The  first  two  references  do  not  explicitly  use  the  terminology  of  distribution  theory,  but  the 
result  is  the  same.) 

The  issues  related  to  the  multiplication  of  distributions  are  relatively  hidden  in  quantum  mechanics, 
and  to  some  extent  also  for  the  sound  speed  cusp.  Since  the  wavefunction  itself  is  continuous,  the 
products  that  involve  distributions  occur  in  third  and  higher-order  terms  in  the  Hamiltonian  H  - 
examples  include  the  terms  proportional  to 

(v2y)v2,vy(v2y)v,y(v2y)v2,(v2y)2.  (9i) 

(On  the  other  hand,  products  of  distributions  are  already  in  Section  4.1,  where  we  consider  quasi-first 
order  theory  for  an  interface  where  the  sound  speed  jumps.)  Obscuring  the  whole  issue  even  further  is  the 


ww  In  considering  this  range-independent  problem,  we  will  exploit  the  fact  that  the  eigenvector  structure  of  the 
solution  will  force  the  orders  in  the  operator  A  to  decouple.  Basically,  this  means  that  any  distribution  generated  in 
A" X  must  cancel  internally,  and  so  adding  a  higher  order  to  the  PE  will  add  new  boundary  conditions,  but  not 
change  preexisting  boundary  conditions  extracted  from  lower  order  theory.  The  details  of  this  assertion  are 
discussed  at  the  beginning  of  Appendix  K.2.1.2. 
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fact  that  in  quantum  mechanics,  one  typically  calculates  expectation  values,  often  using  decompositions 
of  unity  to  simplify  the  evaluation  of  these  expressions.  Specifically,  physical  quantities  are  characterized 
by  expectation  values  of  (Hermitian)  operators,  and  products  within  these  expectation  values  can  be 
broken  apart  by  interjecting  decompositions  of  unity  of  the  form  (p^jlfp  |  (where  the  set  of 

n 

eigenvectors  \(pn  )  (for  the  time-independent  problem)  form  a  basis  for  the  solution  set).  Then,  products 

of  distributions  such  as  in  Eq.  (91)  seem  to  go  away.  Decompositions  of  unity  are  also  useful  because 
they  reduce  the  number  of  distinct  expectation  values  that  need  to  be  calculated,  and  the  technique  is 
particularly  appealing  when  used  in  the  context  of  the  atomic  problem,  because  it  helps  to  skirt  the  issue 
that  V  itself  is  already  infinite  at  the  origin™. 

5.2.3  Caveats 

It  is  important  to  keep  in  mind  that  the  multiplication  of  distributions  is  not  defined  in  general.  Thus, 
the  8  -function  bifurcation  prescription  only  applies  in  certain  specified  contexts,  and  it  must  be  applied 
with  care.  The  prescription  works  well  when  the  associated  step  functions  are  raised  to  simple  powers 

(e.g.,  a  term  proportional  to  0"  (z  —  f  (.r)) 8(z  —  f  (a'))  ),  and  when  it  is  used  in  conjunction  with  the 

ordinary  rules  of  differentiation  such  as  the  chain  rule  and  the  product  rule.  An  example  of  the  proper  use 
of  the  8  -function  bifurcation  prescription  in  conjunction  with  the  chain  rule  would  be  if  we  are  given 

some  function  of  the  step  function  —  /  (*))),  and  then  define  its  derivative 

dg(0{z-f))/dz  =  g'(0(z-f))-dQ(z-f)/dz=[dg/dy]y=e(:f)8(z-f(x)) 


(and  similarly  for  dg /dx ).  Similarly,  the  product  rule  would  imply  that 


^[g(&{z-f(x)))-h(Q(z-f(x)))_  = 
g(M(^/(  ))).^(Q(z-/(x)))  +  g(Q(z_/(x))). 


dh(yQ(z-f(x))) 

dz 


However,  naively  multiplying  through  by  functions  of  the  step  function  will  cause  problems.  For 
example,  multiplications  such  as  \J 1  +  constant  ■  0  ■  8  or  [1/(1+  constant  ■  0)J  ■  8  will  produce 


incorrect  results  when  combined  with  the  bifurcation  prescription  (at  least  if  both  sides  are  weighted 
equally).  These  functions  apparently  distort  the  step  and  change  the  weighting  factor.  On  the  other  hand, 
we  can  expand  these  functions  in  a  Taylor  series  expansion,  which  consists  of  a  sum  of  terms  that  are 
products  of  the  form  0"  ■  8 .  Now,  for  these  clean  undistorted  steps  we  can  use  the  8  -function 
bifurcation  prescription. 


xx  However,  keep  in  mind  that  for  the  time-independent  case,  the  higher-order  Hamiltonian  H  is  strictly  made  up  of 
powers  of  A  =  — ZrV2/2 m0  +  V  ,  and  as  noted  in  the  previous  footnote,  A  is,  crudely  speaking,  interchangeable  with 
an  eigenvalue.  Thus,  A"<p  °c  cp  and  so  the  infinities  associated  with  powers  of  V.  with  the  distributions  of  the  type 
shown  in  Eq.  (91),  and  with  derivatives  of  the  wavefunction  <p ,  must  cancel  when  all  are  taken  together.  (Also  note 
that  there  must  be  internal  cancellations  between  infinities  generated  by  the  operators  V2  and  V  associated  with 
each  specific  application  of  the  operator  A  .) 
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The  observation  that  8  -function  bifurcation  works  for  undistorted  steps  is  an  intuitively  sensible 
result.  On  the  other  hand,  it  is  not  a  priori  clear  that  there  would  not  be  a  problem  with  distorted  steps 
generated  internal  to  the  chain  rule.  In  other  words  it  is  somewhat  surprising  that  in  this  context  the  step 

in  g'(0)  is  apparently  undistorted  with  respect  to  the  step  0 .  For  present  purposes,  it  is  sufficient  to 

note  that  the  result  seems  to  hold  in  the  two  rather  different  examples  we  consider  in  the  Appendix  Kyy  as 
well  as  in  the  Green’s  function  integral  formula  in  scattering  theory,  and  that  these  examples  are  typical  of 
scenarios  that  arise  in  the  context  of  the  PE.  Development  of  a  full  mathematical  theory  of  the  general 
result  (including  possible  restrictions  on  its  validity)  will  be  deferred  to  future  research22.  Similar  issues 
apply  to  the  product  rule.  For  our  purposes,  we  note  that  the  product  rule  is  used  in  one  form  or  another  in 
all  four  examples  discussed  in  the  Appendix  K  as  well  as  in  the  Green’s  function  integral  formulas  from 

scattering  theory,  and  that  it  consistently  produces  correct  results.  Thus,  in  this  context  the  steps  in  g  (0) 
and  /r(0)  are  apparently  undistorted  and  the  product  rule  can  be  used  in  conjunction  with  8  -function 

bifurcation.  (There  is  some  further  discussion  concerning  the  need  for  undistorted  steps  in  Appendix 
K.2.2.) 

5.2.4  Plausibility  Arguments 

Appendix  K  provides  a  plausibility  argument  for  the  8  -function  bifurcation  rules  given  above. 
Section  K.  1  considers  two  cases  where  the  rules  apply: 

■  Subsection  K.  1 . 1  examines  a  flat  interface  where  the  sound  speed  jumps,  but  not  the  density, 

■  Subsection  K.  1.2  considers  the  full  two-fluid  interface  in  the  full  wave  (Helmholtz  equation) 
problem. 

Then,  Section  K.2  examines  two  cases  where  the  prescription  only  seems  to  work  to  first  order: 

■  The  substitution  a  =  \j  p  is  analyzed  in  Subsection  K.2. 1.1  and 

■  Tappert’s  change  of  variable  substitution  u  =  A/ y fp  is  evaluated  in  Subsection  K.2. 1.2. 

In  Section  K.2.2,  we  examine  why  some  cases  only  seem  to  work  to  first  order.  In  particular,  the  need  for 
undistorted  steps  is  discussed  in  this  section,  and  then  our  observations  are  verified  in  Subsection  K.2. 3, 
when  the  CC  =  \j  p  case  is  extended  to  second  order  using  a  Taylor  series  expansion.  Finally,  the 
associative  property  for  our  prescription  is  briefly  examined  in  Section  K.3. 

Before  closing  this  subject,  it  should  be  emphasized  that  it  is  implicitly  assumed  in  the  8  -function- 
bifurcation  rules  that  the  parameters  jump  at  an  interface,  but  do  not  otherwise  vary  in  the  vicinity  of  the 
interface.  As  we  see  in  Appendix  J.2  (the  argument  leading  to  Eq.  (J.  19)),  extra  care  must  be  exercised  if 
we  apply  8  -function  bifurcation  in  situations  where  this  condition  is  violated.  In  particular,  the 
associative  property  (see  Appendix  K.3)  in  choosing  which  distributions  to  bifurcate  may  not  hold,  and 
explicit  symmetrization  may  be  necessary. 


yy  The  chain  rule  for  differentiation  and  the  product  rule  for  differentiation  are  both  used  in  the  full  wave  two-fluid 
problem  considered  in  Appendix  K.  1.2,  and  following  a  more  complicated  scenario,  they  are  also  ultimately  needed 
for  the  successful  implementation  of  the  change  of  variable  (COV)  substitution  discussed  in  Appendix  K.2. 1.2. 

22  The  result  appears  to  be  related  to  the  fact  that  in  =  \dg /dy]  ^ ,  the  discontinuous  function  0(z)  is 

not  really  embedded  inside  the  function  dg/dy  .  Instead,  it  is  applied  after  the  fact  to  produce  a  “clean”  (i.e., 
undistorted)  step.  In  fact,  given  the  discontinuities  in  both  g  and  0 ,  there  does  not  appear  to  be  any  way  to  make 
sense  of  g'(Q)  other  than  the  interpretation  g'(y)  with  y  first  treated  as  a  general,  continuous  function,  and  only 
subsequently  set  equal  to  a  discontinuous  function. 


Using  the  F olds -Wouthuy  sen  Transformation 


83 


5.3  The  Interface  Where  the  Density  Jumps 

In  Section  5.3.1,  the  Hamiltonian  (Eq.  (90))  is  converted  to  a  form  that  is  useful  for  the  special  case 
where  the  density  jumps  along  an  interface,  but  is  otherwise  constant  in  the  half-spaces.  This  section  also 
examines  the  transformation  between  the  pressure  field  A  and  the  auxiliary  field  X  associated  with  the 
PE.  Section  5.3.2  generalizes  to  the  density  jump  the  discussion  of  Section  4.1  by  examining  the 
boundary  conditions  in  quasi-first-order  theory.  The  previous  discussion  is  extended  as  the  new  FW 
terms  are  examined  closely  in  Section  5.3.3.  Section  5.3.4  examines  the  classical  Lamb  shift  for  a  rough 
interface  where  the  density  and  compressibility  jump  and  so  it  is  a  generalization  of  Section  4.3.  Finally, 
Section  5.4  examines  the  significance  of  the  classical  Lamb  shift  in  a  typical  scenario  for  acoustic 
propagation  in  a  shallow  water  (coastal)  environment. 

5.3.1  The  Basic  Formal  Structure 

5.3.1.1  The  Form  of  the  Hamiltonian  Useful  When  the  Density  Jumps 

We  are  now  in  a  position  to  use  the  8  -function-bifurcation  prescription  established  in  Section  5.2  to 
tailor  the  general  fourth-order  Hamiltonian  for  an  acoustic  field  in  an  environment  where  the  density 
varies  (Eq.  (90)  of  Section  5.1)  to  the  special  case  where  the  density  jumps  at  interfaces,  but  is  otherwise 
constant: 


H  =  rik0 


,  „  A  A2  A3 

1  —  2y  h - — 

K  2k-  2  kl 


5A4  ,1  „  x  l 
8  kt  v  /JU3  16  k4 


3  {A, A] 


2k. 


2  K 


+  5  th  order. 


(92) 


This  result  is  derived  in  Appendix  J.2.  In  the  derivation,  it  is  assumed  that  a  quasi-planar  interface 
separates  two  half-spaces,  where  the  compressibility  K  (or  equivalently  p  =  /2(l  —  K/K0))  is  assumed 

to  be  range -independent  in  the  half-spaces  and  the  density  p  (or  equivalently  Y  =  /(l  —  p/p^))  is 

assumed  to  be  constant  in  the  half-spaces.  Both  quantities  may  jump  along  the  range -dependent  interface. 
The  assumption  that  there  are  only  two  half-spaces  is  made  to  keep  the  discussion  straightforward,  but 
there  is  nothing  in  the  derivation  itself  that  disallows  a  scenario  where  several  interfaces  are  present. 

As  discussed  in  Appendix  J.2  (and  to  a  lesser  extent  later  here  in  Section  5.3.1),  in  the  half-spaces 
where  the  density  is  constant,  the  terms  involving  the  anticommutator  [A,  y]  simplify  dramatically. 

Equation  (92)  maintains  the  more  complicated  form  because  these  terms  will  generate  8  -functions  at  the 
interface,  and  only  the  full  form  involving  these  anticommutators  will  produce  the  correct  distributions 
along  the  interface. 

During  the  derivation  in  Appendix  J.2,  we  also  monitor  the  possibility  that  the  compressibility  K  (or 
equivalently  LI )  may  be  range-dependent  in  the  half  spaces.  At  the  end  of  the  derivation,  we  will  find 
that  if  p  is  range-dependent  in  the  half-spaces,  we  pick  up  an  extra  (nominally  fourth-order)  term  in  TL : 
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The  subscript  S  indicates  that  As  is  to  be  evaluated  precisely  on  the  interface  S  .  The  superscript  ± 

serves  to  remind  us  that  as  As  bifurcates,  dll / r)x  is  evaluated  just  inside  the  half-spaces.  Furthermore, 
since  we  are  now  allowing  LI  and,  consequently,  A  to  depend  on  the  range  away  from  the  interface,  the 

usual  term  proportional  to  X2  must  also  be  added  to  the  Hamiltonian  'H  in  the  half-spaces  away  from  the 
interface.  Indicating  that  this  quantity  is  evaluated  in  its  standard  form  everywhere  except  right  on  top  of 

the  interface,  we  identify  the  standard  A2  -term  as  a  principal  value  (PV).  Note  that  if  LI  is  range- 
mdependent  in  the  half-spaces,  then  both  sides  of  Eq.  (93)  indeed  reduce  to  zero. 


Equations  (92)  and  (93)  define  a  diagonal  matrix  Hamiltonian  7 ~L  to  be  used  in  a  wave  equation  of  the 
form 


d4o' 

dxyZ; 


=  n 


' 9 ' 


(94) 


where  6  and  %  are  decoupled  scalar  fields  that  propagate  in  the  uprange  and  downrange  direction, 
respectively  (as  always,  we  will  study  the  behavior  of  % ).  (The  fields  6  and  %  differ  from  the  fields  of 
the  same  name  defined  in  Eq.  (85)  by  a  sequence  of  FW  transformations.  As  with  the  Hamiltonian  ‘H  , 
we  have  dropped  the  tildes  that  we  sometimes  associate  with  quantities  that  come  out  of  the  FW 
procedure.  Once  the  diagonalization  procedure  has  been  completed,  these  tildes  have  outlived  their 
usefulness.) 

Note  that  the  assumption  that  the  density  is  constant  within  a  given  medium,  and  only  changes  when 
one  passes  from  one  medium  to  another,  is  very  reasonable.  The  compressibility  is  much  more  likely  to 
vary  significantly  within  a  medium.  Indeed,  sound  speed  fluctuations  within  a  given  medium  are 
typically  generated  by  fluctuations  of  the  compressibility  rather  than  of  the  density. 

5.3.1.2  The  Hamiltonian  in  the  Half-spaces 

The  PE  formalism  not  only  picks  out  a  specific  downrange  direction,  but  also  reference  values  for  the 
density  and  compressibility.  These  are  typically  chosen  to  be  the  maximum  values  of  these  quantities. 
For  the  basic  interface  problem,  there  are  two  values  of  the  density  in  play,  and  y  will  be  nonzero  on  the 

side  where  the  density  is  smaller  (i.e.,  different  from  the  reference  value  p0  =  /?maximum)-  On  that  side  of 
the  interface,  the  Hamiltonian  will  involve  an  expansion  not  only  in  A  as  before,  but  also  in  y,  the 
measure  of  the  departure  of  the  local  value  of  the  density  from  its  reference  value.  (In  power  counting, 
both  A  and  y  contribute  equally.)  As  before,  the  expansion  must  eventually  converge  to  the  “exact” 
square  root  operator  (neglecting  any  local  range  dependence  in  the  half-spaces).  There  are  several  ways 
to  combine  terms  to  verify  that  this  is  indeed  the  case.  In  the  Appendix  L.l,  one  of  these  is  derived: 


(95) 


where  CC  =  p0j  p .  (Note  that  this  result  is  only  valid  in  half-spaces  where  p  is  locally  constant  and  the 
range  dependence  in  K  is  locally  weak.)  The  second  instance  of  OC  in  Eq.  (95)  (i.e.,  the  one  in  the 
product  2  a  A)  comes  from  the  series 
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i + (2  r)+ (2  rf  +  (2  yf + ( 2 r)4  ■■■= y = «. 


(96) 


and  the  factor  of  1/ a  in  front  simply  comes  from  1/  CC  =  1  —  2y .  Note  that  this  expansion  indeed  recovers 

the  correct  result  H  =  +  k2  .  The  result  (Eq.  (95))  emerges  from  a  resummation  of  terms  that 

misses  out  on  some  cancellations,  and  therefore  implies  a  more  stringent  convergence  condition  than  is 
really  the  case.  It  is,  however,  a  particularly  compact  form  of  the  full  expansion,  and  therefore  useful  as  a 
mnemonic  for  reproducing  the  new  PE  expansion  in  the  half-spaces.  CC  should  always  be  written  in  terms 
of  y ,  and  the  result  for  H  should  be  expanded  in  A  and  y  to  the  desired  order  (with  A  and  y  counting 
equally  in  determining  the  order). 

An  alternate  closed  form  for  °°  -order  H  that  is  based  on  a  different  grouping  of  the  terms  in  the  A  - 
y  expansion  is  derived  in  Appendix  Subsection  L.2.  This  form  more  accurately  reflects  the  convergence 
properties  of  the  expansion  -  in  fact,  the  infinite-order  result  looks  just  like  the  traditional  form  of  the 
Hamiltonian  in  the  water  column  (i.e.,  the  form  that  leads  to  an  expansion  in  the  sound  speed  deviation 
only,  Eq.  (4)).  This  result  suggests  that  we  will  be  able  to  proceed  as  before  and  use  traditional  PE  codes 
in  the  water  column,  even  though  formally  the  Hamiltonian  in  the  water  column  now  contains  an 
expansion  in  the  parameter  measuring  the  density  jump:  y.  However,  note  that  to  determine  the  actual 
order  of  such  a  result,  we  will  still  have  to  formally  expand  in  A  and  y  the  Hamiltonian  we  actually  use. 


5.3.1.3  The  Transformation  Between  the  Pressure  Field  and  the 
Auxiliary  Field  Associated  with  the  PE 

As  briefly  discussed  near  the  beginning  of  Appendix  C.2.1,  the  fact  that  the  FW  transformation  is  by 
construction  pseudo-unitary  guarantees  that  the  integral  over  transverse  space  of  the  quantity  |<?|  ~\%\ 

always  remains  proportional  to  the  total  energy  flux  in  the  uprange/downrange  direction.  This  is  true  even 
if  6  and  %  are  the  decoupled  fields  that  have  emerged  after  repeated  applications  of  the  FW 
transformation  (e.g.,  as  in  Eq.  (94)).  This  implies  that  %  will  always  be  some  kind  of  carrier  of 

downrange  flux.  Specifically,  | zf  should  differ  from  the  downrange  energy  flux  by  no  more  than  an 
integration  by  parts.  This  leads  us  to  surmise  that  the  carrier  of  flux  %  must  have  a  relationship  vis-a-vis 
the  pressure  field  A  that  is  similar  to  the  one  it  had  when  the  density  was  constant;  however,  now  an 
extra  factor  of  1 J sfp  must  be  thrown  in  because  the  square  of  this  quantity  is  in  the  energy  flux,  but  it  is 

no  longer  globally  constant.  The  result  (again  valid  in  half-spaces  where  the  density  is  locally  constant 
and  the  range  dependence  of  the  compressibility  is  locally  weak)  is 


>  Cl. 

-Jp\  dx 


A 


oc 


1+ 


2aA 


A  =  Z- 


vo  j 


(97) 


Examining  the  first  and  second  orders  explicitly  verifies  that  our  conjecture  is  correct  (see  Appendix  D.2). 
Again  note  that  Eq.  (97)  is  essentially  a  mnemonic  for  the  corresponding  finite  order  expansion  in  A  and 

y .  Also  note  that  \[J7  is  still  a  near-eigenoperator  and  so  nearly  cancels  in  weakly  range -dependent 
environments,  but  now  there  is  also  a  factor  \fcx  that  cannot  be  dropped  even  if  there  is  no  range 
dependence  at  all.  (For  this  argument  to  work,  we  also  need  to  recall  that  the  commutator  of  yfa  and 
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\[J7  is  nonzero  only  right  on  top  of  the  interface,  where  Eq.  (97)  is  not  valid  anyway,  and  where  we  do 
not  need  to  transform  from  the  auxiliary  field  %  back  to  the  pressure  field  A .)  Note  that,  strictly 
speaking,  the  result  for  the  endpoint  calculation  is  only  as  accurate  up  the  same  order  in  A  and  y  as  the 
Hamiltonian  used. 


5.3.2  Deterministic  Quasi-First-Order  Theory 

We  now  have  everything  we  need  to  examine  quasi-first-order  deterministic  theory  for  the  case  where 
the  density  jumps  along  an  interface.  This  study  extends  the  calculation  in  Section  4.1  (which  is  itself  an 
extension  to  a  sound  speed  jump  of  the  discussion  in  Section  3.2.3  for  the  sound  speed  cusp). 

The  current  situation  is  similar  to  that  described  toward  the  end  of  Section  5.3.1,  but  here  we 
specifically  restrict  ourselves  to  the  standard  two-dimensional  x  —  z  space.  Regions  1  and  11  are 

separated  by  a  rough  surface  z  =  /  (jc)  along  which  the  density  p  and  the  compressibility  K  jump. 
These  parameters  are  now  both  assumed  to  be  constant  in  Regions  I  and  II  (but  this  will  again  be  relaxed 
for  K  in  Section  5.4).  The  reference  values  for  these  parameters  are  in  general  p{)  and  K0 .  Since  the 

density  is  no  longer  globally  constant,  LI  generalizes  to  p  =  (l  —  yK^j2  —  (l  —  n2  •/%)/2  ,  where  as 

before  n  =  c0/c  =  ^J[pK]/\p0KQ\ .  Similarly,  )/2-  As  Just  noted  in  Section  5.3.1,  the 

reference  parameters  pQ  and  K0  are  typically  chosen  to  be  their  maximal  values.  In  the  conventions  used 

here,  pu  and  K,  are  the  maximal  quantities,  and  so  K0  =  Kt  and  p0  =  pn  .  This  corresponds  to  the 

prototypical  case  where  Region  I  is  the  water  column  and  Region  II  is  sediment  in  the  sub-bottom 
(generally  of  the  ocean).  These  conventions  are  summarized  in  Fig.  17. 

As  in  Section  3.1,  express  the  diagonalized  matrix  Hamiltonian  in  terms  of  a  scalar  Hamiltonian: 
TL  =  pH  .  The  quasi-first  order  scalar  Hamiltonian  H  based  on  Eq.  (92)  is  then 


H  =  k0+A-2k0y- 


(98) 


From  Eq.  (94),  we  see  that  this  Hamiltonian  is  associated  with  the  usual  scalar  PE  (Eq.  (18))  for 
downrange  propagation: 


—i  —  =  H  x  ■ 
dx 


The  5  -function-bifurcation  rules  outlined  in  Section  5.2  are  used  as  needed  to  evaluate  H.  The 
infinitesimal  transverse  integrations  described  in  Section  3.3.3  (and  already  used  in  the  simpler  version  of 
this  problem  described  in  Section  4.1)  are  applied  to  the  PE.  The  boundary  conditions  that  emerge  are 
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Field  that  carries 
downrange  flux:  %n 


Region  I: 
Typically  water 


Physical  parameters  in  region  I: 

р,  =  density  in  region  I 

с,  =  soundspeed  in  region  I 

K,  =  compressibility  in  region  I  =  — 
Pici 


x+ (downrange) 

r  =  df_ 

dx 


f  = 


d1  f 
dx 2 


Field  that  carries 
downrange  flux:  %, 


Region  II: 

Typically  the  bottom 


<SK  =  Ku  -  K,;K;  is  the  reference  value  K0 
8p  =  pn  —  Pj ;  pn  is  the  reference  value  p0 


Physical  parameters  in  region  II: 
p„  =  density  in  region  II 
cn  =  soundspeed  in  region  II 

K „  =  compressibility  in  region  I  =  — r- 
Pucn 


Use  for  K,  >  Kn ;  pn  >  p, 
Reference  wavenumber:  k0 


CO, 


—  co^K.0p0  —  eo^K !  Pn 


Reference  sound  speed  calculated  using 
reference  density  and  compressibility; 
note  that  the  reference  values  are  typically  chosen 
to  be  the  largest  values  of  the  parameter 


Fig.  17  —  We  consider  the  standard  two-dimensional  x  —  z  space.  Regions  I  and  II  are  separated  by  the 
rough  surface  z  =  f(x).  The  positive  z-axis  points  from  Region  II  into  Region  I.  The  density  p  and  (for 
the  moment)  also  the  compressibility  K  are  constant  in  the  half-spaces.  Region  I  is  typically  the  water 
column,  and  Region  II  is  typically  sediment  in  the  sub-bottom.  In  order  to  minimize  the  related  PE  expansion 
parameters  y  and  // ,  the  reference  values  should  be  chosen  to  be  the  maximum  values  of  the  parameters.  In 
this  case,  this  means  that  the  reference  density  is  chosen  to  be  that  in  Region  II  (the  bottom)  and  the  reference 
compressibility  is  chosen  to  be  that  in  Region  I  (the  water  column). 


A  bar  denotes  an  average  between  the  respective  quantities  evaluated  on  the  two  sides  of  the  interface. 
Details  of  the  derivation  of  boundary  conditions  (Eq.  (99))  are  given  in  Appendix  M.  Setting 
SK/  Kj  =  -2//„  and  dp  -  0  ,  we  verify  that  Eq.  (99)  is  indeed  a  straightforward  generalization  of  Eq. 
(71). 

We  defer  the  extension  of  the  discussion  in  Section  4.2.1  to  Section  6.1.5.  In  that  section,  we  again  use 
the  full-wave  boundary  conditions  to  gain  an  understanding  of  the  deterministic  boundary  conditions 

generated  by  distributions  in  A  —  this  time,  the  more  general  X  responsible  for  the  boundary  conditions 
given  by  Eq.  (99). 

Having  derived  the  boundary  conditions  for  quasi-first-order  theory,  let  us  gain  further  insight  into  the 
new  terms  uniquely  generated  by  the  FW  transformation  (i.e.,  those  with  an  explicit  range  derivative;  as 
we  have  already  argued,  such  terms  are  loosely  associated  with  “vacuum  polarization”).  This  is  done  in 
the  next  section.  There,  Section  5.3.4  examines  the  stochastic  problem. 
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5.3.3  A  Close  Look  at  the  New  Foldy-Wouthuysen  Terms 

5.3.3.1  Higher-order  FW  Terms 

As  frequently  discussed  above,  the  FW  procedure  generates  a  Hamiltonian  that  uniquely  includes  a 
series  of  terms  explicitly  associated  with  the  range  dependence.  It  was  noted  in  Section  4.4  that  the  terms 

of  this  series  that  are  proportional  to  X  correspond  to  the  phenomenon  of  “vacuum  polarization”  known 
from  quantum  mechanics  (because  this  type  of  term  involves  virtual  particle  pairs)aaa.  The  lowest-order 

member  of  this  class  of  terms  is  the  third-order  term  j ,  that  was  just  evaluated  in  Section  5.3.2. 

Examining  Eq.  (92),  we  see  that  there  are  also  two  nominally  fourth-order  terms  in  this  class: 
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Below,  we  present  an  estimate  for  the  magnitude  of  these  terms  for  typical  values  of  the  environmental 
parameters,  and  verify  that  they  are  indeed  getting  smaller.  Furthermore,  8  -function  bifurcation  can  be 

used  to  convert  the  —  )  term  into  a  form  that  suggests  an  infinite  series  encountered 

previously.  The  infinite  series  can  be  evaluated  in  closed  form.  This  allows  us  to  examine  the  boundary 
conditions  for  Dirichlet  ( %  —  0 )  and  Neumann  ( d%/dn  =  0  for  the  full  wave  problem,  but  this  changes  a 
little  bit  for  the  PE)  boundary  conditions.  The  Neumann  boundary  condition  for  the  PE  implies  the 
existence  of  curvature-induced  boundary  waves.  Curvature-induced  boundary  waves  have  not  yet  been 
fully  incorporated  into  modern  scattering  theory,  and  the  PE  based  on  the  FW  transformation  presents  a 
promising  new  technique  for  doing  so.  Curvature-induced  boundary  waves  were  studied  by  Biot  and 
Tolstoy  in  a  line  of  development  that  has  until  now  run  parallel  to  mainstream  rough-surface  scattering 
theory.  The  relationship  between  the  Biot-Tolstoy  scattering  theory  and  the  curvature-induced  boundary 
waves  predicted  by  the  parabolic  equation  based  on  the  FW  transformation  is  discussed  at  the  end  of  this 
subsection. 


5.3.3.2  Specific  Examples 

As  in  Section  5.3.2,  we  consider  two-dimensional  x  —  z  space,  and  Regions  I  and  II  separated  by  the 
rough  surface  z  =  /(-*)•  Values  p  and  K  are  constants  in  the  half-spaces,  but  jump  at  the  interface. 

The  reference  compressibility  is  K, ,  and  the  reference  density  is  pu  .  This  implies  that  LI I  =  yu  =  0 .  In 
this  section,  we  choose  the  values 


K**=KI=K0=3Kn=3Kmi 

2/Va.er  =  2P,  =  Po  =  Pi,  =  Ps  and  •  ( 100) 

<Va,er=1500m/s  =>  csand  «  1 840 m/s 

These  parameter  values  are  representative  of  underwater  acoustics  scenarios. 

Comparing  the  fourth-order  term  X]  with  the  third-order  term  —  Xj ( 8k(>  j ,  we  find  terms  that 

differ  from  the  corresponding  (bifurcated)  third-order  contributions  in  one  of  two  basic  ways: 


aaa  An  example  of  a  new  term  introduced  by  the  FW  that  is  not  of  the  “vacuum  polarization”  class  of  terms  would  be 
the  term  given  by  Eq.  (93). 
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■  Terms  that  differ  from  the  corresponding  third-order  term  by  a  factor  ([— 3-&t]/[4^f0])(Vy/fc+  j 
(operating  on  the  wave  function  %± ). 

■  Terms  that  differ  from  the  corresponding  third-order  term  by  a  factor  3  [/l±  —  y±  )  . 

(Recall  that  the  subscript  ±  indicates  that  the  parameters  are  to  be  evaluated  just  inside  Region  I  (+)  or 
Region  II  (-).) 

In  Appendix  N.  1.1,  it  is  argued  that  for  the  parameters  given  in  Eq.  (100),  terms  of  the  first  sort  will 
typically  provide  a  negligible  correction  on  the  order  of  1%  to  2%  while  terms  of  the  second  sort 
introduce  a  still-modest  12.5%  correction. 


In  Appendix  N.1.2,  it  is  established  that  for  the  usual  scenario,  where  yu  =0(i.e.,  the  reference 
density  is  in  Region  II),  8  -function  bifurcation  reduces  the  term  —Ay^X/Sk//^  to 


-Ay 


yx_ 

A 


-2  y, 


X 

sky 


(101) 


Given  the  parameters  listed  in  Eq.  (100),  this  represents  a  50%  correction  to  the  third-order  term 
—  A/S k.1  .  This  error  is  significant  enough  to  cause  concern  that  in  many  practical  instances,  it  may  prove 

necessary  to  use  higher-order  theory  to  correctly  evaluate  this  term.  Therefore,  in  Appendix  N.2,  the 
result  is  extrapolated  to  infinite  orders. 

5.3.3.3  Infinite-order  Theory 

The  extrapolation  is  obtained  as  follows.  The  term  given  in  Eq.  (101)  hints  at  the  sequence  of  terms 


(1  +  2  y+) 


-X 


which  contains  the  familiar  factor  (l  +  2j'+).  This  hints  at  the  beginning  of  a  series  that  plays  a 
prominent  role  in  our  expansion  of  the  PE,  namely  the  series  given  in  Eq.  (96): 
1  +  (2y)  +  (2yy  +  (2y)  =  P0 / p  .  This  suggests  that  at  infinite  order,  we  have  —  [X/S k/)  '{Pii / Pi)  ■ 


Modifying  the  boundary  conditions  accordingly,  and  then  taking  the  limits  pu  »  ps  and  Kn  <SC  Kt , 
we  get  the  Dirichlet  boundary  conditions  for  the  field  in  Region  II  and  the  Neumann  boundary  conditions 
for  the  field  in  Region  Ihbb.  The  details  are  in  Appendix  N.2.  The  Dirichlet  boundary  condition  is 
unchanged,  but  note  what  happens  to  the  Neumann  boundary  condition.  It  becomes 

VTX,=~Xl.  (102) 


bbb  Note  that  these  boundary  conditions  would  apply,  for  example,  if  Region  II  represents  the  water  column  under  an 
air-sea  interface  or  Region  I  represents  the  water  column  above  a  very  hard  bottom.  Note,  this  implies  that  the 
Dirichlet  boundary  condition  would  apply  for  an  acoustic  field  incident  from  a  hard  bottom  onto  the  water  column 
and  the  Neumann  boundary  conditions  would  apply  on  a  sound  field  incident  from  the  air  onto  the  ocean  surface. 
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This  is  significant  because  such  boundary  conditions  allow  solutions  containing  transverse  dependence  of 
the  form  exp(— /  ■  z /4  )  —  i.e.,  boundary  waves.  As  noted  above,  these  curvature -induced  boundary 

waves  have  not  yet  been  fully  incorporated  into  modern  scattering  theory,  and  the  PE  based  on  the  FW 
transformation  presents  a  promising  new  technique  for  doing  so. 

5.3.3.4  Boundary  Waves 

Boundary  waves  have  proven  to  be  quite  problematic  in  scattering  theories  based  on  the  full-wave 
acoustic  equation.  For  example,  they  do  not  show  up  at  low  orders  of  perturbative,  small-slope  or 
Kirchhoff  scattering  theory.  If  they  show  up  at  all,  boundary  wave  phenomena  creep  in  as  multiple 
scattering  effects  at  high  orders  of  these  approximations.  As  Thorsos  and  others  have  pointed  out  [82-85], 
the  fact  that  a  given  order  can  ignore  boundary  waves  is  intimately  connected  to  the  fact  that  the  above 
rough  surface  scattering  theories  do  not  conserve  energy  at  each  order.  A  prominent  example  is  lowest 
order  perturbation  theory  for  the  Neumann  boundary  condition,  which  predicts  a  scattered  field  even  at 
glancing  incidence,  when  the  energy  flux  incident  on  the  surface  goes  to  zero.  The  perturbation,  small- 
slope,  and  Kirchhoff  approximation  (and  others  less  well  known)  are  all  part  of  a  closely  related  family  of 
approximations,  and  therefore  they  share  many  advantages  and  deficiencies,  including  their  problems 
associated  with  boundary  wave  phenomena  such  as  the  lack  of  order-by-order  energy  conservation.  Note 
that  the  PE  has  energy  conservation  built  in,  so  a  rough  surface  scattering  theory  directly  based  on  it 
should  include  the  phenomenon  of  roughness-induced  boundary  waves.  This  is  a  key  distinction  between 
traditional  theories  of  rough  surface  scattering  and  the  energy-conserving  PE,  and  it  is  potentially  a 
fruitful  topic  for  future  research. 

Appendix  O  presents  a  closer  look  at  curvature -induced  boundary  waves  in  the  context  of  modern 
acoustic  scattering  theory  and  points  to  ways  in  which  the  PE  based  on  the  FW  transformation  may 
contribute  to  our  understanding  of  this  issue.  This  appendix  also  notes  that  effects  that  explicitly  depend 
on  the  square  of  the  local  slopeccc  raise  issues  that  are  similar  to  the  ones  posed  by  curvature-induced 
boundary  waves.  Both  effects  are  artificially  pushed  to  high  orders  by  the  various  perturbative 
approximations  in  widespread  use,  and  both  involve  physics  that  is  new  to  the  theoretical  framework  and 
is  uniquely  sensitive  to  the  size  scale  between  the  Bragg  (wavelength)  scale  and  [ wavelength ]/2tt  (this 
is  a  third  scale  in  addition  to  the  large  (>  wavelength)  scale  and  the  Bragg  scale).  The  new  physics  is  a 
consequence  of  the  fact  that  the  curvature  /  and  the  square  of  the  slope  / 2  typically  remain  sensitive  to 
subwavelength  scales,  while  the  surface  displacement  /  does  notddd.  However,  this  sensitivity  to  small 
scales  also  causes  trouble.  Such  terms  fail  to  properly  cut  themselves  off  at  very  small  scales,  and, 
therefore,  they  raise  the  issue  of  renormalization.  The  PE  provides  a  serviceable  solution  to  the 
renormalization  problem:  the  issue  of  renormalization  can  be  sidestepped  with  the  use  of  a  single  upper 
cutoff.  The  advantages  of  the  PE  are,  then,  that  it  brings  curvature  and  tilt-induced  effects  to  their  natural 
place  at  low  order,  and  that  it  presents  a  serviceable  solution  to  the  renormalization  problem. 


ccc  This  refers  to  effects  explicitly  proportional  to  f2  such  as  those  discussed  in  Section  4.2.1,  and  not  to  effects  that 
implicitly  depend  on  the  slope,  such  as  tilting  of  the  diffraction  grating  introduced  by  the  small  slope  approximation 
(see  Appendix  O). 

ddd  This  statement  applies  to  the  rough  surfaces  typically  encountered  by  classical  fields.  As  noted  in  Section  4.4, 
this  does  not  hold  in  the  analogous  problem  from  atomic  physics,  where  even  the  equivalent  of/ becomes  sensitive 
to  small  scales. 
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5.3.3.5  Relationship  to  Other  Approaches  Such  as  Biot-Tolstoy  Theory 

There  is  an  independent  line  of  research  that  shares  many  of  these  advantages.  However,  it  has 
remained  outside  the  mainstream  of  scattering  theory.  M.  A.  Biot  and  I.  Tolstoy  have  developed  a  theory 
for  scattering  from  bossed  surfaces  that  predicts  curvature-induced  boundary  conditions  that  are  very 
similar  to  those  generated  by  the  “vacuum  polarization”  term.  This  work  was  outlined  in  a  series  of 
articles  beginning  in  the  late  1950s.  A  flavor  of  the  effort  is  provided  by  Refs.  86  through  88,  and  a  more 
complete  reference  list  is  available  in  the  annotated  bibliography  at  the  end  of  Ref.  8.  Their  boundary 

conditions  contain  terms  that  are  proportional  to  the  curvature  /  ,  and  resemble  the  O  [  /'  j  contribution 
to  boundary  conditions  on  d%/dz  given  Eq.  (71)  for  a  sound  speed  jump,  Eq.  (99)  for  a  density  and 
compressibility  jump,  and  Eq.  (102)  for  the  Neumann  boundary  condition.  (For  that  matter,  the  O  (/2) 

term  in  the  d%/dz  boundary  term  associated  with  the  sound  speed  cusp  and  given  in  Eq.  (31)  is  similar 
as  well.) 

The  similarities  reflect  profound  links  between  the  results  obtained  using  the  two  very  different 
approaches.  The  physics  of  Biot-Tolstoy  theory  is  basically  the  same  as  that  associated  with  the 
curvature -generated  component  of  the  “vacuum  polarization”  term.  The  dependence  on  the  curvature 
ensures  that  the  associated  physical  effects  are  sensitive  to  a  broad  spectrum  of  wave  numbers  in  the 
rough  surface  spectrum.  Therefore,  via  the  uncertainty  principle,  we  know  that  they  are  also  sensitive  to 
scales  smaller  than  a  wavelength.  The  associated  physical  effects  include  curvature-induced  boundary 
waves  and  other  emergent  multiple  scattering  phenomena.  Although  these  effects  are  obscured  by 
perturbation  theory,  they  are  not  vanishingly  small.  Biot-Tolstoy  boundary  waves  have  been  observed  in 
the  laboratory  [89-90]. 

There  are  also  several  differences  between  the  “vacuum  polarization”  contribution  to  the  PE  and  Biot- 
Tolstoy  scattering.  The  PE  deals  with  both  cusps  and  jumps  in  the  environmental  parameters,  while  Biot- 
Tolstoy  theory  only  handles  jumps.  The  remaining  differences  between  the  two  approaches  reflect  the 
different  ways  that  the  rough  surface  is  constructed.  Biot-Tolstoy  theory  constructs  the  surface  using 
bosses  (hemispheres,  sections  of  hemispheres,  and  indentations  of  the  same  shape).  Generally  speaking, 
Biot-Tolstoy  theory  allows  for  surfaces  that  are  rougher  than  those  allowed  by  the  PE,  which  only  allows 
surfaces  that  are  smooth  displacements  from  the  range -independent  problem  and  have  modest  slope  and 
curvature.  The  PE  thus  operates  towards  the  smooth-surface  end  of  Biot-Tolstoy  theory.  To  use  Biot  and 
Tolstoy’s  terminology,  the  parabolic  equation  is  restricted  to  surfaces  with  small  form  factors.  Thus,  it 
can  only  generate  boundary  waves  along  concave  deterministic  surfaces  (associated  with  parameter 
jumps)  or  along  cusps.  Bossed  surfaces  of  the  type  addressed  by  Biot-Tolstoy  theory  can  have  non-zero 
average  curvature,  and  so  boundary  waves  are  possible  even  in  the  stochastic  problem.  On  the  other 
hand,  the  parabolic  equation  deals  directly  with  the  surface  relief  function  /,  and  so  it  can  easily  handle 
relatively  smooth  deterministic  surfaces.  More  generally,  there  are  a  number  of  complications  associated 
with  decomposing  surfaces  into  bosses,  and  these  are  avoided  by  the  PE. 

Finally,  we  should  also  note  that  Barbone  and  Spivak  have  also  combined  effective  impedance 
boundary  conditions  (vaguely  like  Eq.  (102)  above)  with  the  PE  to  model  scattering  from  the  ocean 
bottom  [91].  Here  acoustic  scattering  from  the  bottom  is  modeled  using  an  apparent  impedance  that  is 
treated  as  a  perturbation  from  the  boundary  conditions  for  a  rough  pressure-release  (i.e.,  Dirichlet 
boundary  condition)  surface.  The  physics  here  is  not  the  same  as  the  phenomena  described  by  the 
“vacuum  polarization”  term  in  the  PE. 

Next,  let  us  consider  the  stochastic  problem  for  the  PE  where  the  density  and  compressibility  both 
jump. 
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5.3.4  The  Classical  Lamb  Shift  for  Sp  4  0 

This  section  examines  the  stochastic  boundary  conditions  corresponding  to  a  field  that  obeys  the 
boundary  conditions  shown  in  Eq.  (99)  in  the  deterministic  case.  All  the  assumptions  described  by  Fig. 
17  must,  of  course,  continue  to  hold  here.  Specifically,  assume  that  Kr  >  Ku  and  p:  <  pn  ,  and  that  the 
reference  values  are  chosen  to  be  the  larger  value.  As  in  Sections  3.2.3  and  4.3,  adapt  the  technique 
originally  developed  by  Kuperman  [65].  Project  the  boundary  conditions  at  Z  =  f  down  to  the  line 
Z  =  0.  Then  break  the  wave  function  evaluated  at  z  =  0  into  coherent  and  incoherent  parts: 
X\__ 0  =  (z)(0)  +  '  and  average  these  boundary  conditions. 

The  stochastic  version  of  the  boundary  conditions  shown  in  Eq.  (99)  turns  out  to  be: 
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(103) 


z= 0 


As  always,  an  overbar  indicates  an  averaging  between  the  values  from  Regions  I  and  II.  Note  that  if 
we  set  Pj  =  pn  ,  Sp  =  0  and  SK/  K;  =  —2 pu  ,  we  recover  Eq.  (81).  Recall  that  as  always,  the  surface 

spectrum  used  to  calculated  / f and  \fS%\  cuts  off  at  k0  =  yj Kjpn  .  The  details  of  the  derivation  of 
Eq.  (103)  are  presented  in  Appendix  P. 

In  the  d%/dz  boundary  condition  of  Eq.  (103),  there  is  an  apparent  “vacuum  polarization”  term  (i.e., 
downrange  Lamb  shift): 


Sp 

1  d3{x) 

8  K 

p-  dz3 

This  time,  unlike  for  the  sound  speed  cusp  and  our  “toy  model”  for  the  hydrogen  atom  (Eqs.  (40)  and 
(51),  respectively),  this  “vacuum  polarization”  contribution  has  the  same  sign  as  has  the  “traditional”  (or 
transverse)  Lamb  shift  (caused  by  smearing).  That  is  because  this  term  is  generated  by  a  “hole.”  In  other 
words,  it  results  from  the  absence  of  a  term  rather  than  from  the  active  presence  of  a  term.  The  term  that 
is  “missing”  from  the  deterministic  boundary  conditions  (Eq.  (99))  is 
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This  is  the  first  time  that  we  have  encountered  a  “hole”  like  this,  and  for  the  time  being,  the  result  will  be 
treated  as  tentative.  These  issues  are  discussed  further  in  Section  P.2. 

Appendix  P.3  discusses  the  stepping  algorithm  for  the  discretization  of  the  stochastic  problem 
characterized  by  the  stochastic  boundary  conditions  shown  in  Eq.  (103).  The  complications  discussed 
there  are  all  related  to  the  boundary  conditions  involving  the  second  and  third  derivatives  of  the  stochastic 
field  .  To  get  a  quick  and  dirty  idea  of  the  magnitude  of  the  classical  (acoustic)  Lamb  shift  in  the 
context  of  shallow  water  acoustics,  these  problematic  terms  are  dropped  in  the  calculation  in  Section  5.4. 
For  shallow  grazing  angles,  these  higher-order  transverse  derivatives  of  the  field  are  small  and  this 
approximation  is  perfectly  legitimate.  The  remaining  boundary  conditions  still  conserve  energy. 

5.4  The  Significance  of  the  Classical  Lamb  Shift  in  Underwater  Acoustics 

5.4.1  The  Basic  Formalism 

In  this  section,  we  consider  long-range  ducted  propagation  in  a  realistic  shallow-water  environment. 
The  depth  of  the  water  column  is  on  the  order  of  100  m  and  the  penetrable  bottom  is  composed  of  rough 
sand,  mud,  or  soft  fluid-like  rock. 

In  many  such  shallow-water  scenarios  there  are  also  sound  speed  gradients  in  the  water  column  and  in 
the  bottom,  and  these  gradients  are  not  the  same  in  magnitude.  This  generates  a  sound  speed  cusp  along 
the  seafloor.  Furthermore,  the  gradients  in  the  sound  speed  may  even  experience  discontinuities  within  the 
media,  and  these  too  may  occur  along  rough  interfaces.  Thus,  both  the  surface  and  cusp  Lamb  shifts  are 
present.  To  describe  downrange  propagation  under  such  a  scenario,  we  employ  the  quasi-first  order 
stochastic  variable  density  result  with  the  sound  speed  cusp  included.  Note  that  Eq.  (93),  which  adds  a 
term  associated  with  a  variable  sound  speed  in  the  half-spaces,  is  nominally  fourth  order,  while  the  term 

proportional  to  A  that  is  present  in  quasi-first  order  theory  is  nominally  third  order.  Thus,  to  obtain  quasi- 
first  order  theory  where  there  are  parameter  jumps  and  sound  speed  cusps,  we  should  simply  combine  the 
boundary  conditions  shown  in  Eqs.  (40)  and  (103).  In  order  to  put  the  results  into  a  form  suited  for 
numerical  studies,  we  use  boundary  conditions  Eqs.  ((P.14)  and  (P.20))  in  lieu  of  those  in  Eq.  (103). 
These  alternate  boundary  conditions  introduce  the  variables  A  and  B  : 
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where 
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As  in  Section  4.3  (specifically,  the  discussion  leading  to  Eq.  (82)),  we  now  drop  the  terms  associated  with 
diffuse  (Bragg)  scattering.  Furthermore,  as  discussed  at  the  end  of  Section  5.3.4,  we  also  drop  the  terms 

proportional  to  the  second  and  third  derivatives  of  the  stochastic  field  (^%)  -The  latter  is  a  good 

approximation  for  shallow  grazing  angles.  Here  in  Section  5.4,  as  we  are  considering  long-range  ducted 
propagation  in  a  shallow-water  environment,  mode-stripping  rapidly  eliminates  any  high-grazing  angle 
modes,  leaving  us  only  with  shallow-grazing  angle  modes.  Thus,  in  the  present  context  it  is  perfectly 
legitimate  to  eliminate  the  wide-angle  terms  proportional  to  second  and  third  derivatives  of  the  stochastic 

field  (x)  ■  The  result  of  these  approximations  is 
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Next,  let  us  relate  this  result  to  the  acoustic  pressure  field.  Let  us  define  a  new  field  w 
Now 


(sfHj'.H  =  (JWf'.jHsfH 


=  4h  ~4h  =  o. 


and  so  w  obeys  the  same  wave  equation  and  consequently  the  same  boundary  conditions  as  does  X  ■ 
Also  note  that  in  the  half-spaces,  we  have 


Thus,  with  the  usual  choice  p0  =  pu  ,  in  Region  I  we  have 


and  in  Region  II,  wu  =  Au  . 

yfpjJi  : 


Now,  let  us  define  a  new  field  by  multiplying  w  by  the  constant 
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The  field  w  obeys  the  same  PE  and  the  same  boundary  conditions  as  w  and  X  >  and  in  Region  I  it  is 
equal  to  the  pressure  field:  wf  =  Aj .  Taking  stochastic  averages,  we  now  have  (w/)  =  (A/),  where 
obeys  boundary  conditions  (Eq.  (105)).  This  effectively  means  that  we  are  free  to  apply  boundary 
conditions  (Eq.  (105))  directly  to  the  stochastic  acoustic  field  ^A^  in  Region  1  (the  water  column).  In 
other  words,  given  the  initial  acoustic  field  in  Region  I,  we  implicitly  convert  to  (wI )  ,  propagate  and  then 
perform  the  (trivial)  conversion  back  to  the  acoustic  field  ( A,  ^ .  To  perform  the  conversion  from  (  w/f ) 
to  (An)  ,  we  would  have  to  multiply  by  ^ pn  / p,  ,  but  we  will  not  need  this  at  present. 

Boundary  conditions  (Eq.  (105))  apply  to  a  stochastic  quasi-planar  rough  surface.  It  is  important  to 
appreciate  that  there  is  a  fundamental  difference  between  scattering  from  this  kind  of  rough  interface  and 
a  simpler  problem  such  as,  for  example,  propagation  through  a  simple  penetrable  wedge.  A  theory 
describing  the  wedge  problem  can  be  checked  against  an  analytical  solution.  On  the  other  hand,  recalling 
the  discussion  in  Section  2.3,  we  know  that  theories  describing  rough  surface  scattering  phenomena,  such 
as  diffuse  (Bragg)  scattering  and  the  Lamb  shift,  cannot  be  similarly  benchmarked  against  closed-form 
solutions.  Like  chaos,  these  phenomena  appear  when  analytical  solutions  are  not  available.  The  validity  of 
the  theories  and  numerical  implementations  related  to  such  phenomena  must  somehow  be  evaluated  by 
comparing  existing  stochastic  data  sets  (taken  either  in  the  field  or  in  the  laboratory)  to  ensemble  averages 
of  deterministic  calculations,  or  (a  little  more  easily)  to  stochastic  theories.  The  classical  Lamb  shift  given 
by  boundary  conditions  (Eq.  (105))  is  a  stochastic  result  that  can  be  used  in  this  manner.  Thus,  it  is  useful 
to  employ  boundary  conditions  (Eq.  (105))  to  simulate  this  phenomenon  for  typical  values  of  the  relevant 
environmental  parameters,  and  then  examine  the  feasibility  of  using  experimental  results  to  validate  the 
theory. 

5.4.2  A  Numerical  Study 

Ralph  Baer  of  NRL  has  conducted  such  a  study.  His  calculations  are  based  on  the  NRL  Range- 
dependent  Acoustic  Model  (RAM)  PE  code  [92],  a  standard  PE  code  in  widespread  use  today.  Baer 
incorporated  boundary  conditions  (Eq.  (105))  into  this  code,  and  used  the  result  to  simulate  the  classical 
Lamb  shift  for  a  realistic  shallow  water  environment.  The  results  are  presented  in  Fig.  18.  In  Fig.  18(a), 
we  show  a  typical  shallow  water  sound  speed  profile,  where  a  cusp  (i.e.,  a  knee  in  the  sound  speed 
profile)  is  advected  by  internal  waves,  and  the  interface  between  the  water  (light  gray  area)  and  the 
bottom  (dark  grey  area)  is  stochastically  rough.  A  typical  water-column  depth  is  100  m.  The  internal 
waves  are  given  stochastically  by  the  Garret-Munk  spectrum,  and  the  bottom  roughness  by  a  spectrum 
typically  taken  from  the  work  of  Darrell  Jackson  [93-96]  or  Essen  [97].  In  Fig.  18(b),  the  transmission 
loss  (TL)  as  a  function  of  range  is  graphed  for  a  simple  example.  The  solid  line  corresponds  to  the 
traditional  theory,  while  the  dashed  line  is  the  result  when  the  classical  Lamb  shift  is  included.  The  range 
shown  varies  from  24  to  25  km.  The  source  and  receiver  depths  are  both  50  m  —  i.e.,  midway  in  the 
water  column.  In  this  particular  example,  the  sound  speed  varies  linearly  from  1500  m/s  at  the  flat  air-sea 
boundary  to  1530  m/s  at  the  rough  bottom.  The  bottom  roughness  is  characterized  by  the  two- 

dimensional  spectrum  S  (kj  =  (0.002  m4  j  ■  (h()k )  5  (with  h0  =  1  ),  which  is  then  used  to  calculate 
and  ^/“^ .  These  values  are  then  brought  over  to  the  one-dimensional  problem.  The  peak  of  the 
bottom  spectrum  kp  is  assumed  to  be  kp  =  0. 1  m  (corresponding  to  a  period  in  the  undulations  along 
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Transmission  loss  with  and  without  the  acoustic  Lamb  shift 


Key  input  parameters: 

•  average  sound-speed  profile;  density  in  bottom 

•  bottom  interface  roughness  spectra 

•  internal  wave  spectra 


(a) 


Fig.  18  —  (a)  A  typical  shallow  water  sound  speed  profile,  where  a  cusp  (i.e.,  a  knee  in  the  sound  speed  profile)  is 
advected  by  internal  waves,  and  the  interface  between  the  water  (light  gray  area)  and  the  bottom  (dark  gray  area)  is 
stochastically  rough,  (b)  The  transmission  loss  as  a  function  of  the  range  is  graphed  for  a  simple  example  in  a 
realistic  shallow  water  environment:  a  100  m-deep  water  column  over  a  rough  sandy,  muddy,  or  fluid-like  rock 
bottom.  Typical  values  are  used  for  the  various  environmental  parameters  involved.  (The  precise  values  are  laid  out 
in  the  text.)  The  solid  line  corresponds  to  the  traditional  theory,  while  the  dashed  line  is  the  result  when  the  classical 
Lamb  shift  is  included.  For  this  example,  note  that  there  is  no  cusp  advected  by  internal  waves;  but  there  is  a  cusp  at 
the  rough  bottom.  Note  that  the  classical  Lamb  shift  illustrated  above  resembles  a  spatial  shift.  The  actual  Lamb 
shift  is  in  the  downrange  wave  number,  and  after  downrange  propagation,  this  translates  approximately  into  a  spatial 
shift  (the  different  modes  are  shifted  by  different  distances,  and  so  overall  there  is  a  little  more  going  on  than  a 
simple  spatial  shift).  When  mode  stripping  eliminates  all  but  a  very  small  number  of  modes,  this  spatial  shift 
becomes  clearly  defined  as  seen  above.  Ralph  Baer  performed  this  calculation. 


the  bottom  on  the  order  of  60  m).  The  sound  speed  in  the  bottom  is  a  constant  2000  m/s,  and  the  density 
in  the  bottom  is  1.5  times  the  density  of  water.  For  this  particular  example,  note  that  there  is  no  cusp 
advected  by  internal  waves;  but  there  is  a  cusp  at  the  rough  bottom. 

5.4.3  The  Significance  of  the  Results 

The  Lamb  shift  appears  for  all  types  of  rough  interfaces,  and  in  principle  its  effect  can  be  quite  large. 
Examining  Fig.  18(b)  it  is  immediately  apparent  that  the  classical  Lamb  shift,  at  least  in  this  typical  case, 
represents  a  spatial  shift.  As  discussed  in  the  figure  caption,  this  is  no  surprise  since  the  actual  shift  is  in 
the  downrange  wave  numbers,  and  after  propagation  this  translates  into  a  shift  in  the  pattern  of  the 
transmission  loss  as  a  function  of  the  range.  Unfortunately,  in  underwater  acoustics,  such  a  shift  may  be 
relatively  difficult  to  measure  since  both  the  field  and  the  environment  are  usually  sampled  only 
intermittently.  The  best  way  to  measure  the  acoustic  Lamb  shift  would  be  to  isolate  modes  and  look  for 
roughness-induced  changes  to  the  beat  frequency  between  pairs  of  low  modes.  This  effect  could  be  most 
easily  measured  in  the  laboratory.  Such  an  experiment  would  certainly  be  instructive  and  have  scientific 
value,  but  from  the  point  of  view  of  systems  development,  it  may  not  be  of  crucial  importance,  since  the 
directly  analogous  phenomenon  has  been  measured  in  the  quantum  problem.  On  the  other  hand,  interface 
waves  (i.e.,  localization)  associated  with  the  Lamb-shift  effect  near  (stochastically  advected)  cusps  could 
have  significant  practical  applications,  and  the  full  development  of  our  understanding  of  such  phenomena 
will  require  further  study. 
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Furthermore,  as  noted  at  the  end  of  Section  5.3.3,  the  curvature-induced  (for  a  parameter  jump)  and 
tilt-induced  (for  a  cusp)  boundary  waves  that  we  have  found  in  the  deterministic  problem  are  related  to 
acoustic  Biot-Tolstoy  boundary  waves  and  these  have  also  been  observed  in  the  laboratory  [86-90].  This 
effect  is  probably  quite  small,  but  this  assertion  also  requires  further  study. 

In  the  context  of  underwater  acoustics,  an  additional  value  of  considering  the  acoustic  Lamb  shift  lies 
in  the  role  it  plays  as  a  thought  experiment  that  establishes  the  validity  of  the  techniques  being  employed. 
Detailed  experiments  on  the  hydrogen  atom  have  established  the  atomic  Lamb  shift  as  a  bona  fide 
physical  effect.  This  experience  teaches  us  that  the  8  -functions  spawned  by  the  PE  derived  using  the 
FW  transformation  are  associated  with  real  physical  effects  rather  than  being  unfortunate  artifacts  that 
compromise  the  PE. 

Significantly,  our  study  of  the  Lamb  shift  also  exposes  the  role  that  these  contact  potentials  can  play 
specifically  in  the  context  of  multiscale  rough  surface  scattering.  As  discussed  in  Section  4.4,  the  Lamb 
shift  phenomena  associated  with  the  hydrogen  atom  and  a  sound  speed  cusp  —  and  from  Eq.  (103) 
apparently  also  that  associated  with  a  density  jump  —  all  include  both  a  dominant  contribution  due  to 

“smearing”  (i.e.,  the  “transverse  Lamb  shift  proportional  to  A' 2  ^ / 2  ^ )  and  a  vacuum  polarization 

correction  (i.e.,  the  “downrange  Lamb  shift”  proportional  to  {^fj  ).  (In  the  hydrogen  atom,  there  is  also  a 

third  contribution  associated  with  an  anomalous  magnetic  moment,  but  this  does  not  occur  in  acoustics.) 
The  vacuum  polarization  effect  in  particular  is  a  true  "broad  spectrum”  rough  surface  effect  (i.e.,  one  that 
involves  all  scales  from  the  largest  down  to  the  limits  of  the  uncertainty  principle  —  about  1/12  of  a 
wavelength).  Thus,  our  consideration  of  the  Lamb  shift,  both  atomic  and  acoustic,  tells  us  that  the  8  - 
functions  generated  by  the  FW  transformation  play  an  indispensable  role  in  bringing  together  the  many 
scales  involved  in  the  full  rough  surface  scattering  problem.  The  techniques  that  accurately  describe  this 
effect  can  then  be  used  with  confidence  to  model  more  important  aspects  of  the  rough  surface  scattering 
problem.  We  will  shortly  see  that  introducing  a  density  jump  leads  to  new  FW-generated  8  -functions 
(i.e.,  boundary  conditions)  that  are  directly  relevant  to  the  problem  of  underwater  propagation  near  rough 
ocean  bottoms. 

6.  BRAGG-SCALE  VORTICITY 

This  section  addresses  an  important  phenomenon  associated  with  density  jumps  —  Bragg-scale 
vorticity.  This  effect  is  much  more  important  than  the  classical  Lamb  shift  (let  alone  its  small  “vacuum 
polarization”  component).  In  Section  6.1,  the  effect  is  defined,  and  then  brought  into  the  PE  formalism. 
Although  Bragg-scale  vorticity  enters  the  acoustic  problem  in  a  way  that  evokes  no  direct  analogies  from 
atomic  physics,  the  mathematical  formalism  built  up  to  this  point  imposes  a  specific  procedure  for 
incorporating  this  effect.  The  effects  associated  with  Bragg-scale  vorticity  emerge  naturally  when  the 
high-order  PE  generated  by  the  FW  procedure  is  applied  to  an  interface  where  the  density  jumps.  The 
second-order  PE  at  the  interface  is  analyzed  below,  and  the  subtle  mechanics  associated  with  the 
formalism  are  discussed.  The  result  is  then  placed  into  the  context  of  current  PE  techniques  in  Section 
6.2.  Section  7  adapts  the  formal  approach  developed  for  the  acoustic  field  with  varying  density  to  the 
electromagnetic  and  elastodynamic  problems. 

6.1  The  O  (A2 )  Parabolic  Equation  at  an  Interface  Where  the  Density  Jumps 

In  Section  6.1.1,  we  consider  the  range-independent  problem  and  identify  a  discontinuity  in  the 
transverse  component  of  the  energy  flux  that  occurs  when  the  density  jumps  along  a  horizontal  interface, 
but  is  not  included  in  the  first-order  theory  developed  in  Sections  5.3.1  and  5.3.2.  At  the  conclusion  of 
this  section,  we  note  that  higher  orders  of  the  theory  must  somehow  account  for  this  missing  effect.  This 
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is  quite  subtle,  so  Section  6. 1 .2  opens  the  discussion  with  an  intuitive  overview  of  how  the  formalism 
goes  about  constructing  a  solution.  Section  6.1.3  supplements  this  with  a  discussion  of  the  physical 
significance  of  our  formal  results.  This  section  identifies  the  underlying  physical  effect,  Bragg-scale 
vorticity.  This  has  prepared  us  for  Section  6.1.4,  a  systematic  formal  treatment  of  the  second-order 

( O  (A2 ) )  range-dependent  problem  where  the  density  jumps.  The  resultant  boundary  conditions  given  in 

Eq.  (120)  are  the  most  important  results  of  this  report.  Section  6.1.4  also  examines  how  the  new  physical 
effect,  Bragg-scale  vorticity,  appears  in  the  formalism.  It  also  discusses  how  a  step  function  in  the 
downrange  flux  can  gradually  materialize  as  higher  orders  are  added.  This  includes  a  discussion  of  the 
role  played  by  noncommutivity  relations  that  enable  the  migration  of  boundary  conditions.  Section  6.1.5 
addresses  the  formal  role  that  the  “vacuum  polarization”  terms  play  in  recreating  the  full  wave  problem, 
where  the  density  jumps.  Section  6.1.6  underlines  the  distinction  between  the  two  effects  newly 
incorporated  by  the  FW  procedure:  Bragg-scale  vorticity  and  “vacuum  polarization.” 

6.1.1  The  Basic  Issue 


In  Section  5.4,  as  our  examination  of  the  classical  Lamb  shift  neared  its  conclusion,  we  touched  upon  a 
result  that  holds  great  significance  for  our  primary  goals.  These  goals  were  outlined  in  Section  2.3:  first 
to  understand  why  imperfect  ad  hoc  solutions  have  until  now  proven  to  be  the  only  way  to  apply  the  PE 
formalism  to  a  range-dependent  interface  where  the  density  jumps,  and  second  to  use  this  understanding 
to  systematically  improve  upon  the  current  standard  PE  techniques.  The  achievement  of  these  goals  is 
needed  if  we  are  to  apply  the  PE  to  multiscale  penetrable  rough  surfaces  where  the  density  jumps,  which 
is  the  ultimate  goal  that  is  of  the  most  immediate  practical  importance. 


To  be  specific,  new  insights  that  advance  these  goals  emerge  from  an  expanded  consideration  of  the 


new  function  briefly  examined  in  Section  5.4:  w  = 


As  in  that  context,  for  the  moment  we 


consider  the  range-independent  problem,  and  assume  that  the  density  jumps  along  a  horizontal  line  (e.g., 

Z  =  0),  but  is  (as  always  in  our  current  study)  constant  in  the  half-spaces.  Recall  that  JTl  commutes 
with  the  Hamiltonian  H  ,  and  so  in  the  range-independent  problem,  w  obeys  the  same  wave  equation, 
and  consequently  the  same  boundary  conditions  as  %  itself.  In  simple  0(A)  theory,  we  have  continuity 

of  X  and  of  Yp  •  dx/z  and  the  same  for  w  and  /p-dYz  ■  As  noted  in  Section  5.4,  in  the  half-spaces,  we 
have 


and  with  the  usual  choice  p0  =  pu  ,  in  Region  I  we  have 

•vv,  =  A,,  (106) 

and  in  Region  II,  wn  =  An  .  Substituting  these  results  into  the  continuity  condition  for  w  gives  us 

L/  =  AII  ■ 
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With  y/Pjj/Pj 


Ai  -  A„ 


■  0  along  the  interface  (i.e.,  the  z  =  0  line),  we  have 


d_ 

dx 


dAn 

dx 


using  the  fact  that  Jpuf  Pj 
is  simply  a  constant 


along  the  interface,  and  so  continuity  of  ■  dA/dx  .  Now  consider  the  continuity  condition  on  x/p  ■  : 


1  dwj  _  1  dwn 
Pi  pn  ri¬ 


ll  07) 


Once  again,  since  \j  pn  /  p,  is  just  a  constant,  we  can  use  Eq.  (106)  to  substitute  ^ pu  / pI  ■  A,  for  w,  in 
Eq.  (107).  Similarly,  substitute  An  for  wn  .  This  gives  us 


or  1  dA,  _  1  dAn 
pf  dz  p„  dz  pf  yOjf 

Combining  results,  we  have 

A  1  dA  1  dA 

— , — — —  all  continuous. 

fp  sip  a*  A  3: 


(108) 


In  Appendix  Q.2,  these  results  are  obtained  using  a  variation  on  the  above  argument. 

Now,  let  us  consider  the  time-averaged  energy  flux  vector  [98,  Eqs.  (64.5  and  64.5)]  and  use  a  well 
known  result  for  the  time  averaged  product  of  the  real  parts  of  two  complex  fields;  also  see  Eq.  (C.12): 


S . = 


Im^A*  VA) 


2  pco 


(109) 


Combining  Eqs.  (108)  and  (109),  the  PE  gives  us  the  boundary  condition  on  the  flux: 


AdA  S  A  dA 

5 - ,— - - - all  continuous  . 

p  dx  p  p-  dz 


Now,  let  us  compare  this  with  the  full  wave  result.  In  this  case,  we  have 


dA  1  9A  .  dA  A  dA 

A, — , - all  continuous  =>  pSx~A — - all  continuous  . 

dx  p  dz  dx  p  dz 

Thus,  although  the  O  ( ?i)  (first  order)  PE  conserves  energy  (the  Hamiltonian  is  after  all  Hermitian),  the 

horizontal  and  vertical  fluxes  appear  to  be  redistributed  relative  to  the  full  wave  result.  On  the  other  hand, 
as  discussed  in  Section  2.3,  it  should  be  possible  to  construct  a  downrange  stepping  algorithm  that 
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generates  a  field  that  can  be  made  arbitrarily  close  to  the  full-wave  solution.  Somehow,  as  we  add  higher 
orders,  the  correct  behavior  known  from  the  full-wave  result  must  reappear. 

6.1.2  A  Heuristic  Approach 

Let  us  now  proceed  to  get  a  handle  on  how  all  this  might  fit  into  the  formalism.  First  we  will  examine 
a  heuristic  presentation  drawing  in  aspects  of  the  structure  suggested  by  the  FW  transformation  as  needed. 
Then  Section  6.1.3  will  supplement  these  new  formal  insights  with  a  look  at  the  underlying  physical 
processes,  and  subsequently  in  Section  6.1.4  we  will  move  on  to  a  more  systematic  discussion  that  builds 
up  directly  from  the  formal  structure  generated  by  the  FW  transformation. 


6.1.2.1  The  Contact  Potential  Needed  to  Produce  the  Correct  Jump  in  the  Downrange  Flux 


The  heuristic  discussion  echoes  Appendix  Q.2.  To  begin  with,  we  follow  the  discussion  surrounding 
form  (Q.  10)  and  (Q.  11)  in  Appendix  Q.2.  To  keep  our  heuristic  discussion  as  clear  as  possible,  let  us 
maintain  for  one  more  section  the  assumption  that  the  jump  in  the  density  occurs  along  the  range- 
independent  line  z  —  0.  Then  add  to  the  first  order  Hamiltonian  H  =  k0  —  2  yk0  +  X  an  extra  (energy 


conserving)  term666 


1 

2 K 


—  Y,S'{z)% 
P 


_  Pn  ~P,  _  &P_ 
2  Pn  2  Pn 


(HO) 


(For  the  moment,  take  the  expression  for  yI  as  a  definition,  but  also  keep  in  mind  that  the  expression  for 
Yj  given  in  Eq.  (110)  is  the  same  as  that  obtained  if  we  set  the  reference  density  to  be  the  density  in 
Region  II  —  pn  ,  and  then  take  the  definition  for  y  first  given  in  Eq.  (87)  (Section  5.1),  and  evaluate  it  in 
Region  I  where  the  local  density  p  is  the  constant  pl .  This  result  is,  of  course,  anticipated  when  we  use 
the  label  yt .) 

Recall  (again  using  the  definition  first  given  in  Eq.  (87)) 


2 


V 

- - - y  +  terms  that  have  no  ^-functions 

2k0 


Pv  VrJf  ! 
P  2  k0 


” 

/  \ 

vr| 

Po 

{ p). 

Vts if  , 
2  k0 


The  second  term  on  the  right-hand  side  of  the  second  line  contains  a  simple  8  -function  that  will  not 
survive  an  indefinite  integration  followed  by  an  infinitesimal  integration666.  Thus  we  drop  this  term  as 
well  as  terms  that  contain  no  8  -functions.  Now,  we  multiply  through  by  2 kQ  •(/?//?„)  and  find  that  the 
key  terms  that  generate  the  boundary  condition  on  the  wavefunction  X  itself  are 


Vlx-7i8'(z)x  +  ---- 


eee  Implicitly,  we  are  again  either  truncating  at  first  order  or  bifurcating  8  -functions  in  order  to  make  sense  of  terms 
that  contain  8  -functions  times  field  quantities  that  are  discontinuous. 


or 

yfPiZi  =  yfftiZn  +  T&  order  >  ( 1 1 1 ) 

which  is  indeed  the  boundary  condition  we  would  expect  for  a  quantity  %  that  is  roughly  the  square  root 
of  the  component  of  the  energy  flux  that  is  transverse  to  the  interface. 

Thus  the  term  given  by  Eq.  (110)  will  “fix”  first-order  theory.  However,  we  cannot  simply  insert  it  into 
our  formalism  by  hand.  We  have  no  control  over  what  such  a  term  does  to  the  boundary  condition  on 
d%/dz.  or  on  the  other  boundary  conditions  (i.e.,  conditions  on  Vnr%\n  >  2  that  are  all  implicit  in  first- 
order  theory).  We  therefore  want  the  term  shown  in  Eq.  (110)  to  emerge  naturally  from  our  theory. 
Since  this  term  needs  to  be  present  even  in  the  range -independent  problem,  it  cannot  emerge  from  a 

“vacuum  polarization”  term  generated  by  the  FW  transformation  such  as  —  X/Sk 2  .  We  must  look  for  it 
in  terms  that  are  higher  orders  in  A ,  say  in  the  term  proportional  to  A2 .  However,  note  that  in  the  context 
of  0(/l2)  theory  (with  lead  order  term  proportional  to  V^X ),  the  term  in  Eq.  (110)  will  have  migrated 

from  being  a  boundary  condition  on  X  itself  to  being  a  boundary  condition  on  d2%/dz  ■  This  has  an 
important  side  benefit.  Actual  jumps  in  the  wavefunction  cause  problems,  because  then  the  x  -derivative 
of  X  would  spawn  a  8  -function,  and  this  in  turn  would  lead  to  a  very  problematic  feedback  loop  in  our 
boundary  conditions.  This  problem  is  eliminated  when  the  term  in  Eq.  (110)  is  intrinsically  embedded  in 

°X)- 

6. 1.2. 2  ()(/l2  j  Theory  Generates  the  Needed  Contact  Potential 

The  term  in  Eq.  (110)  will  need  to  emerge  from  cross-terms  in  X1 .  The  O  (A2  j  Hamiltonian  is 

2  K 


H  =  k0  ~2yk0  +  A 
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Now,  let  us  echo  the  analysis  that  begins  with  the  discussion  surrounding  Eqs.  (Q.20)  and  (Q.21),  and 
continues  to  the  end  of  Appendix  Q.2.  The  leading-order  derivative  comes  from  —  X1  /lk0  : 


f  i  Yf  i  A  ( n  ) 

~r-/2k0=-  —  -  vr  ^  v-  ^  vr+- 

[p)  { p) 


Now,  consider  the  two  cross-terms  in  —X’jl k0  that  will  generate  £>' -functions: 

(-i /2h)^P^-(hr)z 


(-i/2t,)Vr^Vr  (~Ku)x- 


The  term  in  Eq.  (110)  will  have  to  emerge  from  these  terms. 

Let  us  examine  the  second  term.  With  our  usual  conventions  that  K0  =  K l  and  p0  =  pu  (useful  if, 
for  example,  medium  I  is  water  and  medium  II  is  mud),  we  find 


1 

r  \ 

_  l 

\  K) 

1 - 

1 - 

2 

Kn 

v  0  y 

~2 

and  so  in  Region  I,  p  —  0  and  in  Region  II, 


K.-K..  SK 

P  =  — - -  = - =  Pi,  > 

2  K,  2K, 


pn=Ul-X**L 

2{  K,pn 


\  2 

=  -  i_£o 

T  1  2 

2  c„ 


Thus,  with  the  positive  z  -axis  pointing  into  Region  I 

-tj-‘  =  -Le(-z) 


-L0(-Z)  i-% 

2  c„ 


2  V  cii  Pu 


=  -^e(-z)  i- 


(where  we  also  used  the  second  part  of  Eq.  (110)  to  obtain  the  last  equality).  We  also  have 

p\-Pu~P  _\SpAP"  =  7i  in  Region  I 
2^  pQ  J  2 pn  [  0  in  Region  II 

and  so 

r=rMz).  (ii6) 

Combining  Eqs.  (115)  and  (116),  we  have 

- k0p  +  k0y  =  -k0Q(-z)pMa  +  k0r,[Q(z)-Q{-z )]  (117) 

at  first  order  for  our  standard  example  K0  =  K ,  and  p{]  =  pu  . 

Next,  we  demonstrate  that  the  second  term  in  Eq.  (117)  will  lead  to  the  term  Eq.  (1 10).  From  Eqs. 
(112)  and  (113),  we  see  that  Eq.  (117)  gets  operated  on  by 

OXiMZl 

v  2&0  y 
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to  form 


V  2k0  j 


vr(^)vr 


2  K 


\~Kli  +  kj)  = 


'__r 

Vr(%Ar, 

V  ^0  J 

/ _ \_ 

2  K  1 

V  2k0, 

2  k0  [ 

(-ko0(-z)//total) 

(^[0(z)-0(-z)]). 


(118) 


Now  it  is  the  second  term  on  the  right-hand  side  of  Eq.  (118)  that  will  reproduce  the  term  shown  in  Eq. 
(1 10).  This  term  becomes 


2 K  p  2 kQ  oTl  {  dz2 


920(-z) 

dz2 


+ 


terns  that  will  not  survive  a 
double  infinitesimal  integration 


Note  that  Vr  operating  on  1/ p  generates  a  simple  8  -function,  which  will  not  survive  a  double 
integration  across  an  infinitesimal  integral,  and  so  this  operation  has  been  dropped  from  the  present 
calculation.  Finally  use 


320(z) 

dz2 


S'(z) 


-S'(-z) 


320(-z) 

dz2 


to  recover  the  term  in  Eq.  (110): 


1 

2 K 


■—y,8\z)x. 

p 


Thus,  the  second  cross-term  given  in  Eq.  (118)  indeed  generates  the  term  in  Eq.  (110)  associated  with  the 
discontinuity  of  % .  (For  more  on  this,  also  see  Appendix  Q.2). 

6. 1.2. 3  The  Terms  Introduced  by  O (A2  j  Theory  That  Are  Not  Associated 
with  the  Jump  in  the  Energy  Flux 

The  first  term  on  the  right-hand  side  of  Eq.  (118)  represents  the  remaining  contribution  from  the  cross¬ 
terms  of  —X1/2k{)  .  It  is  quite  straightforward  to  demonstrate  that  the  contribution  from  this  term  recovers 
the  implicit  boundary  conditions  on  %  that  would  be  expected  if  Bragg-scale  vorticity  were  not  an  issue. 
To  be  specific,  in  the  absence  of  Bragg-scale  vorticity,  we  would  expect  that  in  the  range -independent 
problem  we  are  considering  here,  the  field  %  would  behave  just  like  the  pressure  field  A.  It  is  not 
difficult  to  show  that  the  contribution  to  the  boundary  condition  generated  by  the  first  term  in  Eq.  (118) 
would,  if  taken  alone,  indeed  cause  the  boundary  condition  on  V2^  to  mimic  the  implicit  boundary 
condition  on  V2  A . 

Now  that  we  have  a  handle  on  what  is  basically  going  on,  Section  6.1.3  will  outline  the  underlying 
physical  processes  responsible  for  all  this  mathematical  subtlety.  After  this  is  accomplished,  Section 
6.1.3  will  put  all  of  this  on  a  firmer  foundation. 


Using  the  F olds -Wouthus  sen  Transformation 
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6.1.3  The  Physical  Significance 


The  physical  significance  of  our  findings  is  that  the  FW  transformation  generates  a  one-way  stepping 
algorithm  that  incorporates  bounded  back  and  forth  motion.  In  the  case  of  the  tilt-induced  smearing 
associated  with  the  new  FW  terms,  the  transformation  refines  out  the  average  influence  of  virtual 
backscatter.  When  a  density  jump  is  introduced,  the  formalism  contends  with  something  new  not  found 

in  the  quantum  problem.  The  velocity  of  the  oscillating  fluid  is  given  by  VA/ p  .  Since  the  pressure  A  is 
continuous  along  the  interface,  so  is  (1  —  hh)-  A,  the  component  of  VA  tangent  to  the  interface  ( h  is  as 


always  the  normal  to  the  interface), 
given  by 


Note  that  the  component  of  the  velocity  tangent  to  the  interface  is 


e~i,m  VA 

Vangent  =^(1  ~  ™) - • 

-1(0  p 


The  period  of  oscillation  T  =  7.71  j CO  is  the  same  everywhere.  Therefore,  when  p  is  discontinuous  on  the 
interface,  the  component  of  the  velocity  tangent  to  the  interface  must  also  be  discontinuous.  The  material 
will  move  more  rapidly  on  the  side  of  the  interface  with  the  lower  density.  As  the  material  slips  along  the 
interface,  the  material  is  displaced  further  on  the  side  with  the  lower  density*.  On  a  wavelength  scale,  the 
fluid  thus  picks  up  an  oscillating  twist,  which  is  called  here  Bragg-scale  vorticity  (BSV).  This  vorticity  is 
not  associated  with  a  circulation;  it  is  just  a  bounded  twisting  oscillation.  (However,  the  variability  in  p 
is  enough  to  ensure  that  curl  of  the  velocity  vector  is  non-zero,  and  so  this  is  indeed  a  bona  fide  vorticity.) 
Since  the  energy  flux  S  is  proportional  to  Im 

its  tangential  component  along  the  interface.  This  might  lead  us  to  expect  that  % ,  a  carrier  of  tangential 
or  near-tangential  flux  along  the  quasi-planar  interface,  jumps  as  well. 

The  FW  transformation  takes  both  these  bounded  oscillations  (i.e.,  virtual  backscatter  and  Bragg-scale 
vorticity),  refines  out  the  average  influence,  and  incorporates  them  in  the  forward-stepping  equation.  In 
particular,  the  fact  that  the  new  formalism  allows  us  to  bring  the  higher-order  PE  right  up  to  the  interface 
is  closely  tied  to  the  fact  that  it  incorporates  Bragg-scale  vorticity.  At  lowest  order,  the  formalism  forces 
the  vorticity  to  be  zero,  but  then  it  builds  the  vorticity  order  by  order,  intimately  coupling  it  to  the  higher- 
order  transverse  derivatives.  Boundary  conditions  on  the  carrier  of  flux  %  (and  on  its  lower  order 
derivatives)  migrate  up  to  the  boundary  conditions  on  higher-order  derivatives,  where  they  “launch”  the 
functions  in  a  way  that  appropriately  simulates  the  full-wave  boundary  conditions. 

6.1.4  A  Systematic  Formal  Discussion 

6.1.4.1  The  O  (/l2  j  Boundary  Conditions  Associated  with  a  Sound  Speed  and  Density  Jump 


H'H 


P 


Bragg-scale  vorticity  also  forces  a  jump  in 


Section  6.1.2  opened  a  window  into  the  mechanism  used  by  the  PE  to  address  the  discontinuities 
associated  with  a  density  jump.  Then,  Section  6.1.3  provided  an  insight  into  the  physical  issues  involved, 
and  an  argument  outlining  why  the  FW  procedure  addresses  this  physical  process  in  a  way  that  is  in 
significant  respects  similar  to  the  way  it  handles  the  classical  equivalent  of  the  “vacuum  polarization” 
effect.  Now,  let  us  pull  all  of  this  together  into  a  systematic  formal  discussion. 


fff  Note  that  the  relative  displacement  becomes  greater  as  the  density  jump  increases  or  the  frequency  co  decreases. 
In  this  sense,  the  effect  is  similar  to  torsion  in  elastic  solids. 
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Begin  with  the  O  (/l2 )  PE  generated  by  the  FW  transformation  (this  is  obtained  by  truncating  Eq.  (92) 
of  Section  5.3.1  or  in  this  case,  equivalently,  Eq.  (90)  of  Section  5.1): 


-i*X-k 

1  -I  Ko 

ox 


2  5 


,  „  A  X 

1-2  y  + - - 

'  i  r\  i  2 


2k, 


Z- 


(119) 


0  J 


Once  again,  as  we  focus  on  the  interface  itself,  consider  the  one-dimensional  interface  embedded  in  two- 
dimensional  space  (i.e.,  no  y  -dependence  and  d/dz  <->  Vr  ).  The  interface  is  now  once  again  allowed  to 

be  range -dependent  (e.g.,  z  =  f  (x)).  In  Appendix  Section  Q.l,  it  is  shown  that  this  Hamiltonian 
produces  the  boundary  conditions 

2kk\_^I^  tZi  _  ^ tZii~\  +  tZi  +  k0flnV  tZu  —  0 

2^[(l  Yi  )VtZ,  —  ^ tZh  ]+^0  (Mil  Yl)  Z  —  0  (1°0) 

77  ^tZi  —~p^tZu 
Zi  —  Zu  ■ 

As  always,  the  reference  density  is  on  Region  II  and  the  reference  compressibility  is  on  Region  I.  (Thus, 
Y i  —  ( Pn  —  Pi )/ ftp  a )  >  Pu  —  —{Kn  —  Kj  ft  ft  Kj )  and  y u  —  LI  f  —  0 .  We  also  have  OCf  —  \  ptJ  p\f  — 
pn  / Pj  .  For  a  boundary  between  the  water  column  and  the  sea  bottom.  Region  I  would  typically  be  the 
water,  and  Region  II  would  be  the  bottom.)  Note  that  we  do  not  include  the  0^f2,f^j  (and  higher 

order)  “vacuum  polarization”  terms  at  this  stage,  since  here  we  are  considering  them  to  be  third  order  (and 
higher)  with  each  downrange  derivative  of  A  adding  an  order.  Equation  (120)  is  the  most  important 
result  in  this  entire  endeavor. 


Let  us  take  a  quick  look  at  how  the  result  (Eq.  (120))  comes  about.  We  instantly  see  that  the  cross¬ 
terms  in  A"  will  in  principle  generate  8  -  and  8'  -functions,  but  not  higher-order  derivatives  of  the  8  - 
function.  Therefore,  taking  two  and  three  improper  integrations  followed  by  a  proper  integration  over  an 
infinitesimal  transverse  interval  straddling  the  interface,  we  immediately  obtain  the  above  continuity 
conditions  on  (1 / p)-  ^ TZ  =  {ftp) '  ^Z  ftp  and  %,  respectively.  Take  one  improper  integration  and  then 

integrate  over  the  infinitesimal  interval  to  obtain  the  boundary  condition  for  V2^  (  =  d2 Z /dz2 ).  Take 
one  integration  over  the  infinitesimal  interval  to  obtain  the  boundary  condition  on  V2  % .  The  leading 
order  derivative  VT  ft/ft)  Vj-  ft/ft)  ^ TZ  as  we^  as  the  cross-terms  Vy.  {/ft  ^,PZ  and  V7  (ft)Vryy 
contribute  to  the  last  two  results. 

6. 1.4.2  Relating  the  PE  Boundary  Conditions  to  the  Full-wave  Result 

Looking  at  the  boundary  conditions  on  X  and  Vr/Jf ,  one  might  conclude  that  X  simply  mimics  the 
full  pressure  field  A  in  our  PE  formalism.  This,  of  course,  cannot  be,  because  we  know  from  our  full- 
wave  result  that  A  should  be  continuous,  while  Bragg-scale  vorticity  should  produce  a  discontinuity  in 
X  ■  As  illustrated  in  Section  6.1.2,  the  higher-order  boundary  conditions  begin  to  correct  this  situation  by 
introducing  Bragg-scale  vorticity.  Let  us  examine  this  a  little  more  closely.  The  first  two  boundary 
conditions  in  Eq.  (120)  above  can  be  rewritten  in  the  form 


Using  the  Foldy-Wouthuysen  Transformation 
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=  0. 


(121) 


Here  the  V\x  and  the  V^-X  boundary  conditions  are  presented  in  a  way  to  illustrate  which  part  is 
needed  to  reproduce  the  full  wave  behavior,  and  which  part  comes  from  “extra”  terms  that  reproduce 
Bragg-scale  vorticity.  As  demonstrated  in  Appendix  Subsection  Q.2,  boundary  conditions  containing 
only  the  terms  outside  the  boxes  are  those  that  would  hold  if  X  truly  mimicked  A .  The  terms  in  the 

boxes  come  from  the  S-  and  cT -functions  formally  generated  by  the  cross-term  Vr  (/^)VrJX  and 

from  the  part  of  Vr  (j/p)  ^TUX  that  is  not  needed  to  generate  the  portion  of  the  boundary  conditions  on 

VjX  and  Vj  X  that  mimic  the  (implicit)  boundary  conditions  on  V\A  and  Vy  A  . 


If  we  take,  for  example,  the  S'  -function  responsible  to  first  order  for  the  extra  part  of  the 
V/j  boundary  condition:  (—1/2 k0) a/;S' [z  —  f  (x))  ,  and  insert  it  by  hand  into  the  first-order 
Hamiltonian,  the  required  first-order  j  ump  in  the  carrier  of  downrange  flux  X  is  indeed  generated  (for 
details,  recall  Section  6.1.2  or  see  the  last  paragraph  of  Appendix  Q.2).  However,  note  when  this  same 
cross-term  remains  in  0(  A2 )  theory  (where  it  truly  belongs),  this  S'  -function  modifies  the  VjX 
boundary  condition  rather  than  the  X  boundary  condition.  Thus,  as  predicted  at  the  end  of  Section  6.1.3, 
the  boundary  condition  on  X  has  indeed  migrated  to  become  a  boundary  condition  on  V2TX  ■  Thus,  we 
once  again  encounter  an  interesting  example  of  the  topic  of  migrating  boundary  conditions  first  raised  in 
Section  4.2.2SEE.  This  reminds  us  that  the  following  result  is  very  widely  applicable:  Quite  generally,  we 
must  take  seriously  the  notion  that  a  boundary  condition  is  really  a  S  -function-type  term,  and  that  the 
effects  associated  with  such  a  term  will  migrate  to  boundary  conditions  associated  with  derivatives  of 
different  order  as  the  leading  order  of  the  differential  equation  is  changed. 

This  then  tells  us  something  about  what  the  PE  derived  using  the  FW  transformation  is  doing  at  an 
interface  where  the  density  jumps.  Although  higher-order  theory  introduces  the  effects  of  Bragg-scale 

vorticity,  the  boundary  conditions  will  still  force  X  and  (1//?)  Vrx[=  (1//?)  to  be 

continuous.  The  step  is  smoothed  as  the  FW  transformation  builds  it  as  a  distribution.  (Recall  that  in 
distribution  theory,  the  step  function  is  built  up  as  a  sequence  of  continuous  functions  that  progressively 
approaches  the  profile  of  a  step.  A  sequence  of  arctangent  functions  is  typically  employed  in  this  manner.) 
The  boundary  conditions  on  the  higher-order  derivatives  launch  the  field  at  the  interface  in  such  a  way 
that  a  step  function  is  built  up.  Effectively,  the  interface  is  allowed  to  expand  to  fill  the  space  allowed  by 
the  uncertainty  principle,  and  the  bare  interface  is  buffered  (or  “fuzzed  out”).  This  type  of  smoothing 
imposed  by  a  one-way  (i.e.,  non-relativistic)  theory  has  a  precedent.  For  example,  [99  (especially  the  first 
full  paragraph  of  p.  948)]:  A  point  charge  in  Dirac  theory  becomes  in  nonrelativistic  theory  a  distribution 
of  charge  and  current  extending  over  a  domain  of  linear  dimensions  hj  me  .  The  Darwin  term  and  spin- 


sgs  It  is  worthwhile  at  this  point  to  remind  ourselves  of  examples  of  boundary  condition  migration  that  we  have 
previously  encountered.  Recall  from  the  discussion  in  Section  4.2.2  that,  similarly,  the  contact  potential  associated 
with  cusp-induced  Lamb  shift  really  corresponds  to  boundary  conditions  on  the  third  derivative  of  the  wave 
function,  but  when  it  is  put  into  the  lowest  order  PE  (or  Schrodinger  equation),  it  nominally  corresponds  to  a  jump  in 
the  first  derivative.  The  classical  Lamb  shift  generated  by  an  interface  (e.g.,  for  a  fluid,  where  the  sound  speed 
and/or  the  density  jump(s))  is  ultimately  to  be  associated  with  the  Helmholtz-equation  boundary  conditions  on 
VjX  ,  but  in  0(2)  theory  it  generates  a  jump  in  %  . 
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orbit  interaction  are  consequences  of  this  “fuzzing  out”  or  buffering  of  the  bare  particle.  (In  quantum 
mechanics,  this  is  also  sometimes  known  as  “dressing”  the  singularity.) 

However,  this  raises  an  interesting  issue.  As  just  noted,  when  we  keep  on  increasing  the  order  of  our 
PE  by  repeatedly  applying  the  FW  procedure,  the  behavior  of  the  field  %  should  approach  that  of  a  step 
function  in  much  the  same  way  that  a  series  of  arctangent  functions  approaches  the  Heaviside  step 
function  in  generalized  function  theory  (i.e.,  distribution  theory).  This  implies  that  %  becomes  rapidly 
varying  near  the  interface,  and  consequently  that  the  lower  order  derivatives  of  the  wave  function  should 
become  large.  For  example,  at  some  point  in  the  process  of  moving  to  ever-higher  order,  V\x  anc*  so 

also  the  expansion  parameter  A  would  have  to  become  quite  large.  Nevertheless,  this  type  of 
“convergence  in  the  generalized  sense”  to  a  step  function  can  still  be  reconciled  with  the  notion  that  both 
the  generator  of  the  FW  transformation  and  the  “left  behind”  off-diagonal  elements  are  getting  smaller  as 

we  move  to  higher  and  higher  order.  The  reason  is  as  follows.  The  expansion  parameters  A"  and 
?i"Y>  are  always  small  when  they  are  actually  being  used  to  generate  a  FW  transformation1*'1.  The 
higher  orders  left  in  the  off-diagonal  (odd)  operator  are  small  as  well.  After  the  diagonalizing  FW 
procedure  “has  passed  through”  a  given  order  n  ,  then  A "  and  A"'y"  now  operating  on  a  new  higher- 
order  wave  function  %  may  become  large,  but  by  this  time  they  no  longer  function  as  expansion 
parameters. 

6.1.4.3  The  Role  of  Noneommutivity  in  Boundary  Condition  Migration 

It  is  very  interesting  to  note  that  the  key  structural  condition  that  makes  all  of  this  possible  is  the  fact 
that  A  no  longer  commutes  with  the  Hamiltonian: 

[A,H]*  0. 

When  this  is  the  case,  there  is  nothing  to  tie  the  value,  of  say,  A%  to  A" %  or  of  A%  from  an  earlier 
iteration  to  A%  from  a  later  iteration.  Thus,  A%  can  be  small  in  low-order  theory,  and  then  later  become 
large  at  higher  orders.  Furthermore,  at  the  higher  order  where  A%  is  big,  our  formalism  still  allows  A" % 
to  be  small. 

In  fact,  the  noneommutivity  of  A  and  H  allows  orders  of  the  PE  to  change  the  behavior  of  %  in  a 
number  of  fundamental  ways.  For  example,  it  is  a  necessary  condition  for  the  boundary  conditions  to 
migrate.  Indeed,  in  the  case  of  the  variable  density  acoustic  PE,  we  have  just  found  that  boundary 
conditions  are  allowed  to  migrate  even  in  the  absence  of  range  dependence.  This  follows  from  the  fact 
that  [A,  /]^0  and  so  [A,  H]^0.  Similarly,  when  we  add  the  new  “vacuum  polarization”  terms 
generated  by  the  FW  transformation  (and  associated  with  range  dependence),  here  also  we  find  that  these 
new  terms  end  the  commutivity  of  A  with  H  since  \  A,  A  \  A  0 . 


hhh  I.e.,  when  they  are  the  actual  expansion  parameters  embedded  within  the  generator  of  the  FW  transformation  S 
—  see  Section  3.1  and  Eq.  (12);  and  note  that  S  is  proportional  to  O  ,  the  current  odd  operator  in  the  Hamiltonian 
left  after  some  given  number  of  iterations  of  the  FW  procedure,  and  that  O  contains  operators  of  the  form  A"  and 
Amy"  "‘  as  well  as  operators  with  higher  overall  powers  of  A-y .  Note  that  for  the  moment,  we  are  temporarily 
reverting  to  the  tilde  ( ~  )  notation  to  remind  ourselves  that  the  current  wavefunction  %  and  the  currently  remaining 
odd  operator  O  have  emerged  from  repeated  applications  of  the  FW  transformation,  and  not  just  from  the  original 
ansatz. 
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On  the  other  hand,  boundary  conditions  cannot  migrate  in  the  constant-density  range-independent 
problem  where  the  sole  expansion  parameter  A  commutes  with  the  Hamiltonian  H.  In  this  case,  higher 
orders  only  add  new  explicit  boundary  conditions,  but  they  cannot  change  the  boundary  conditions  at 
lower  orders.  Let  us  examine  why  this  is  so.  Since  the  commutator  [/l,//]  is  now  zero  and  there  is  no 

range  dependence,  we  could  decompose  the  field  %  into  eigenvectors  %■  and  replace  the  dimensionless 
operator  A/k0  with  the  eigenvalues  £j  and  then  (A/k^f  with  e" Then  if  A/k(]  and  £■  are  small,  so 

are  (A/A:0)"  and  s” .  Thus,  we  can  no  longer  have  the  behavior  where  high  orders  of  A/ k{]  are  small 
while  low  orders  are  large.  Furthermore,  8  -functions  and  other  singularities  within  each  term  of  the 
form  {Ajk^'  X j  —  £,]X i  must  all  cancel  one  another  internally11'.  It  follows  that  increasing  the  order  of 

the  PE  can  generate  new  boundary  conditions,  but  it  cannot  modify  boundary  conditions  previously 
obtained  at  a  lower  order.  Recall  that  in  Appendix  K.l  .1.2,  this  insight  was  used  to  verify  that  8  -function 

bifurcation  employed  in  the  O^/l3)  PE  for  a  sound  speed  jump  (and,  of  course,  constant  density) 

correctly  reproduced  the  implicit  boundary  conditions  on  higher-order  derivatives  of  the  wave  function. 
Also  note  that  since  boundary  conditions  obtained  at  low  orders  remain  fixed,  and  higher  orders  only 
serve  to  supplement  these  conditions  with  new  boundary  conditions  affecting  higher-order  derivatives,  it 
follows  that  boundary  conditions  can  no  longer  migrate  in  this  case.  In  this  sense,  the  boundary 
conditions  generated  by  each  order  in  A  decouple. 

For  weak  range  dependence,  we  would  in  general  not  expect  large  deviations  from  the  conclusions 
based  on  the  assumption  that  there  is  no  range  dependence.  However,  recall  that,  as  noted  above,  for  a 
sound  speed  jump  where  the  density  is  the  same  everywhere  {8p  —  0),  the  only  thing  that  allows 
boundary  conditions  to  migrate  is  the  fact  that  range  dependence  adds  new  terms  via  the  FW 
transformation,  and  these  force  the  crucial  commutator  to  become  non-zero:  [A,  //  ]  ^  0  .  Thus,  the  new 
“vacuum  polarization”  terms  have  introduced  a  fundamental  change  in  the  nature  of  the  Hamiltonian  H . 

6.1.5  Tying  Together  “Vacuum  Polarization”  with  Bragg-scale  Vorticity 
and  a  Look  at  Other  Future  Extensions  of  the  Results 

6.1.5.1  Understanding  the  O  ( A )  Terms  Introduced  by  the  Foldy-Wouthuysen  Transformation 

Only  now  that  we  have  established  the  predominant  factor  involved  in  the  boundary  conditions,  Bragg- 
scale  vorticity,  and  the  related  0(8p)  jump  in  the  carrier  of  flux  X-  are  we  truly  ready  to  go  back  and 
generalize  to  the  variable  density  (i.e.,  8p  V  0 )  problem  the  discussion  in  Section  4.2.1,  and  establish  the 
meaning  of  the  O (/,/2)  “vacuum  polarization”  terms  contributed  to  the  deterministic  problem  by  FW 
transformation. 


111  By  switching  to  the  dimensionless  operator  A/k0  ,  we  ensure  that  the  corresponding  eigenvalues  £j  are  also 
dimensionless.  This  allows  us  to  unambiguously  label  the  eigenvalues  £.  as  small. 

111  This  is  ultimately  the  reason  why  contact  potentials  are  allowed  in  the  context  of  the  PE.  Similarly,  it  is  also  why 
we  can  have  a  l/r  potential  in  the  atomic  Schrodinger  equation  and  still  have  the  overall  operator  —V2r /im  +  V 
remain  small. 
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The  tilt-induced  terms  proportional  to  j  (  Sk/k„  —  s/u  j  play  a  role  that  is  essentially  the  same  as  that 
played  by  the  /  //-term  in  the  constant  density  discussion  of  Section  4.2.1;  they  are  related  to  the 
boundary  conditions  for  V2  A  (where  A  is  as  always  the  pressure  field). 

The  “new”  term  proportional  to  Sp  ■  j  ■  S' (z  —  /)  is  related  to  the  jump  in  X  -  which  in  turn  is 
related  to  the  jump  in  downrange  flux,  recalling  that  X  is  roughly  speaking  a  carrier  of  downrange  flux  in 
the  sense  that  to  within  a  integration  by  parts,  \x\  is  proportional  to  4  I  m  ( A  ■  dA/dx'j ,  which  is  in  turn 
also  proportional  to  the  downrange  flux.  The  effects  of  this  term  can  already  be  seen  in  first-order  theory. 
On  the  other  hand,  the  other  “new”  term  proportional  to  8p  ■  / 2  ■  8"  (  z  —  /  )  only  kicks  in  at  O  (A2  j ,  and 

this  one  is  related  to  corrections  to  the  (1 / p)  ■  dA/dz  boundary  condition. 

As  before  in  Section  4.2.1,  the  terms  proportional  to  the  curvature  /  lead  to  local  curvature-induced 
boundary  phenomena  such  as  boundary  waves  (cf.  polaritons  in  electromagnetic  waves). 

6. 1.5.2  A  Term  in  the  Full-wave  Problem  that  is  Implicit  in  the  PE  Formulation 

The  O  (  /  ■  Sp  j  contribution  to  the  downrange  flux  that  exists  in  the  full- wave  problem  apparently 

does  not  enter  the  PE  via  an  explicit  term  contributed  by  the  FW  transformation.  Instead,  the  0(  /  ■  Sp^ 
contribution  intrinsically  migrates  up  to  the  highest  order  derivative  of  %  (e.g.,  in  quasi-first-order  theory 
of  the  sort  considered  throughout  Section  3,  the  V2^  term),  where  it  is  introduced  via  the  implicit 
boundary  condition  on  H X  ■  Note  that  the  tendency  of  the  O  (  /  ■  8p^  term  to  migrate  to  higher  order  is 

not  very  different  from  the  behavior  of  the  O  {Sp2)  terms  introduced  by  Bragg-scale  vorticity.  Both  cases 
reflect  the  tendency  of  the  PE  to  force  the  auxiliary  field  %  to  remain  continuous. 

6.1.5.3  Future  Extensions  Beyond  ()(/l2  j  or  0{X/X)  Theories 

As  before  in  the  constant  density  case  of  Section  4.2.1,  we  have  just  now  seen  that  also  when  the 
density  jumps  (i.e.,  8p  A  0 ),  we  will  need  at  least  O  [A2  ,X)  theory  to  fully  understand  all  the  “vacuum 

polarization”  (i.e.,  °c  /,  /  )  terms.  In  fact,  for  the  variable  density  case  (i.e.,  8p  A  0 ),  we  even  have  a 
term  proportional  to  8"{z  —  f)  that  does  not  contribute  at  all  to  quasi-first-order  (i.e.,  O  |/L  Xj )  theory. 

Therefore,  future  0(A2,  X  j  or  even  O^A3,X\  theories  in  both  their  deterministic  and  stochastic 
versions  will  be  interesting  and  worthwhile,  but  we  will  leave  this  synthesis  to  future  work.  Furthermore, 
we  will  even  defer  for  future  work  a  consideration  of  the  stochastic  version  of  the  basic  O  (/l2 )  theory  of 
Bragg-scale  vorticity  (which  would  of  course  include  diffuse  (Bragg)  scattering).  For  the  near  term,  we 
will  concentrate  on  the  acoustic  Famb  shift  based  on  stochastic  quasi-first-order  O  (A,X)  theory  (minus 
diffuse  scattering)  separately  and  in  isolation  from  the  Bragg-scale  vorticity  based  on  deterministic 
second-order  O  (^2)  theory  (which  implicitly  includes  diffuse  scattering). 
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Furthermore,  recall  that  we  have  all  along  assumed  that  p  does  not  vary  in  the  half  spaces.  Thus,  we 
have  not  yet  allowed  for  the  possibility  that  there  may  be  steps  in  WTp  and  in  V2Tp  at  the  interface. 
Such  singularities  will  lead  to  new  Lamb -shift-type  stochastic  effects  on  the  coherent  field  even  in  basic 
first-order  (i.e.,  O  ( A) )  theory.  In  the  future,  this  possibility  will  need  to  be  examined  as  well.  Finally, 
all  these  versions  of  the  theory  will  have  to  be  thoroughly  compared  to  data. 

6.1.6  The  Distinction  Between  Bragg-sccile  Vorticity  and  the  Lamb  Shift 

It  is  appropriate  to  close  our  examination  of  Bragg-scale  vorticity  by  clearly  reviewing  the  distinctions 
between  this  effect  and  the  classical  Lamb  shift.  The  latter  already  occurs  for  first-order  (i.e.,  0(  A) ) 
theory,  or  if  we  want  to  include  the  classical  equivalent  of  the  vacuum  polarization  correction,  quasi-first- 
order  (i.e.,  O^A,X})  theory.  It  modifies  the  coherent  field  near  stochastic  quasi-planar  surfaces,  where 

the  sound  speed,  sound  speed  gradient,  and/or  density  jump.  The  effect  takes  the  form  of  contact 
potentials,  which  modify  the  boundary  conditions,  and  there  is  a  close  analogy  to  the  Lamb  shift  problem 
of  atomic  physics.  The  validity  of  the  new  physics  predicted  by  the  FW  transformation  is  confirmed  by 
the  relationship  between  the  new  FW  term  for  a  scalar  field  and  the  phenomenon  of  vacuum  polarization 
known  from  atomic  physics.  It  is  relatively  easy  to  incorporate  the  acoustic  Lamb  shift  into  existing 
codes,  and  so  this  has  been  the  first  (and  so  far  only)  aspect  of  the  new  theory  to  be  numerically 
implemented  (see  Section  5.4). 

Bragg-scale  vorticity  occurs  at  a  density  jump,  and  it  is  associated  with  second-order  (i.e.,  0(A2 ) ) 

and  higher  theory.  It  applies  most  dramatically  to  diffuse  (Bragg)  scatter  and  to  the  deterministic  problem. 
The  proper  incorporation  of  Bragg-scale  vorticity  involves  a  basic  restructuring  of  existing  PE  codes,  and 
there  is  no  immediate  quantum  analogy.  On  the  other  hand,  Bragg-scale  vorticity  is  of  greater  importance 
than  the  classical  Lamb  shift  for  modeling  acoustic  fields  since  it  addresses  the  problem  that  has 
ultimately  motivated  this  effort:  that  of  properly  applying  the  PE  to  a  tilted  interface  where  the  density 
jumps  (see  Section  2.3  for  a  discussion  of  why  this  is  so  important).  Furthermore,  as  we  see  in  Section  7, 
jumps  in  electric  and  magnetic  permeability  (in  electromagnetic  theory)  and  in  the  second  Lame 
parameter  (in  the  theory  of  elastic  waves)  involve  similar  issues.  The  development  of  new  PE  codes  that 
include  Bragg-scale  vorticity  will  thus  have  a  high  priority  in  the  future.  Flowever,  since  they  will  have  to 
be  developed  from  scratch,  the  development  of  these  codes  will  not  be  pursued  in  the  current  effort,  but 
will  instead  be  deferred  to  follow-on  work. 

6.2  Comparison  to  the  Currently  Popular  Parabolic  Equation  Techniques  of  Tappert  and  Collins 

This  section  uses  the  new  formalism  developed  in  this  report  to  reexamine  the  two  PE  methods  first 
mentioned  in  Section  2.2.  These  are  currently  the  preferred  techniques  for  describing  range-dependent 
interfaces  with  density  jumps.  In  Section  6.2.1,  the  new  formalism  is  used  to  examine  Tappert’s  technique 
of  changing  the  variable  and  smearing  out  the  interface,  and  in  Section  6.2.2,  Collins’  stair  step  technique 
is  examined  in  light  of  the  new  formalism.  Section  6.2.3  summarizes  the  relative  merits  of  these 
approaches  and  serves  to  guide  the  modeler  toward  the  optimal  method  in  a  given  context. 

6. 2. 1  Tappert ’s  Change  of  Variable  Formalism 

The  change  of  variable  (COV)  method  by  Tappert  briefly  described  in  Section  2.2  (see  also  Ref.  15, 
pp.  262-264)  rests  on  the  observation  that  in  the  full-wave  Eq.  (1),  one  can  change  the  wavefunction  to 

u  =  A j ^Jp  (i.e.,  make  a  change  of  variables  from  A  — »  A  =  ^[p  ■  u  ).  This  leads  to  a  standard 
Helmholtz  equation  for  u  with  a  new  effective  index  of  refraction.  Now,  one  can  proceed  as  before  to 
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create  a  new  PE.  The  discussion  surrounding  Eq.  (19)  demonstrates  that  this,  like  any  other  PE,  will 
actually  propagate  an  auxiliary  wave  % : 


X  x  V#  •  M 


,d_  _A_ 

dx 


X  is  immediately  recognizable  as  a  carrier  of  flux  (if  this  is  not  clear,  see  Eq.  (97)  and  the  related 
discussion).  The  Hamiltonian  that  propagates  this  field  is  a  Hermitian  (i.e.,  magnitude -conserving  or 
“stable”)  Hamiltonian,  and  so  \x\  and  consequently  the  downrange  flux  are  conserved  quantities,  and  it 
follows  so  is  the  energy. 

The  result  obtained  using  the  COV  technique  anticipates  that  obtained  using  the  FW  transformation, 
but  with  a  key  difference  that  becomes  important  in  the  interface  limit.  For  the  substitution  A  — »  <Jp  ■  u  , 

the  expansion  parameter  //effective  =  (l  -  «L2[iccllvc  )/2  picks  up  8-functions  in  the  interface  limit.  The  COV 
formalism  therefore  becomes  too  singular  in  this  limit.  The  FW  approach  effectively  differs  from  the 
COV  approach  in  that  it  takes  the  factor  of  1 / -Jp  in  the  auxiliary  wave  function  u  =  A/ -Jp  and 
expands  it  in  2/  =  (/?0  —  p)jp0  =  —  Ap/p0  .  This  slows  down  convergence  enough  so  that  8  -function 

bifurcation  can  be  used  to  evaluate  those  previously  troublesome  terms  that  involve  products  of 
distributions  at  interfaces.  More  specifically,  the  Helmholtz  equation  generated  by  the  COV  substitution 
can  be  properly  interpreted  even  at  an  interface,  provided  that  one  properly  expands  in  2y=—Ap/p0 

and  then  uses  8  -function  bifurcation  to  evaluate  the  result  (Appendix  K.2.1.2  shows  how  things  begin  to 
work  out).  Proceeding  along  these  lines,  a  PE  similar  to  that  generated  by  the  FW  transformation  could 
then  in  principle  be  obtained  from  the  Helmoltz  equation  containing  8  -function  potentials.  Such  a  PE 
would  effectively  buffer  the  density  jumpkkk.  In  summary,  the  COV  substitution  becomes  too  singular  in 
the  interface  limit,  while  the  FW  procedure  smoothes  out  the  discontinuity  in  the  density  p  by 

interjecting  an  extra  expansion  in  Apj pQ .  In  principle,  the  COV  technique  could  be  applied  to  an 
interface  by  introducing  this  expansion  by  hand. 

On  the  other  hand,  if  the  COV  formalism  is  to  be  applied  directly  to  an  interface  without  such 
additional  modifications  that  bring  it  into  line  with  the  FW  formalism,  then  the  interface  must  be 
artificially  smoothed.  Tappert  originally  adopted  this  ad  hoc  approach  and  it  has  been  used  since.  Since 
the  formalism  generated  by  the  FW  transformation  effectively  smoothes  the  surface  in  a  manner  precisely 
determined  by  field  theory,  it  is  clearly  the  proper  way  to  handle  a  hue  interface.  On  the  other  hand,  there 
are  many  instances  in  nature  where  the  transition  from  one  medium  to  another  is  very  gradual  on  the  order 
of  a  wavelength,  and  in  such  instances,  there  is  no  advantage  to  slowing  down  convergence  by 
interjecting  an  exha  expansion.  The  substitution  proposed  and  developed  by  Tappert  would  then  work 
very  well,  and  indeed  it  would  likely  be  the  better  approach. 

Finally,  note  that  Tappert’s  COV  technique  shares  an  important  physical  insight  with  the  result 
predicted  by  the  FW  transformation.  A  central  result  of  the  procedure  based  on  the  FW  transformation  is 
that  the  density  discontinuity  is  buffered.  The  COV  technique  reconciles  the  PE  with  a  density  jump  by 
smoothing  out  the  interface.  At  least  at  low  orders,  the  two  pictures  are  quite  similar,  implying  that  the 


^  I.e.,  as  discussed  in  Section  6.1.4,  the  PE  would  replace  the  (discontinuous)  step  in  the  downrange  flux  with  a 
distribution  (i.e.,  a  sequence  of  continuous  functions)  that  only  approaches  the  step  function  (at  least  to  within  the 
tolerance  of  the  uncertainty  principle)  in  the  limit  as  the  PE  goes  to  °°  order.  This  is  similar  to  the  effect  that  would 
be  produced  by  “fuzzing  out”  the  density  jump. 
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COV  technique  was  based  on  a  very  profound  physical  insight:  that  the  key  to  reconciling  the  PE  with  the 
issues  associated  with  a  density  jump  lies  in  buffering  the  interface.  The  new  technique  generated  by  the 
FW  transformation  represents  an  advance  since  it  automatically  introduces  this  smoothing  in  a  manner 
dictated  by  a  precise  physical  theory,  rather  than  by  fiat. 

6.2.2  Collins  ’  Stair  Step  PE 

6.2.2.1  The  Stair  Step  PE  Exploits  Several  Attributes  of  the  Range-independent  Problem 

The  current  state  of  the  art  PE  (Stair  Step  PE)  developed  primarily  by  Collins  and  Westwood  [18] 
builds  on  a  very  clean  solution  to  the  truly  range-independent  problem.  As  long  as  an  interface  is 
completely  flat  (i.e.,  range -independent),  designing  an  appropriate  PE  is  perfectly  straightforward  —  even 
when  the  density  jumps  ( 8p  A  0 ).  In  the  range -independent  environment,  this  PE  can  be  written  by 
taking  the  square  root  of  the  wave  equation: 


where 


~p^t  +J(2 


A  =  0: 


-i^  =  ^  +  pVTjVT-k^l-n2)-A, 
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!  +  #•  A 


A  = 
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2  kn 
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and  A  is  the  pressure  field.  Note  that  this  operator  A  is  not  quite  the  same  as  the  one  we  have  been 
using  in  the  context  of  the  FW  procedure  —  in  fact,  it  is  not  strictly  speaking  Hermitian.  However, 
energy  is  nevertheless  conserved  for  this  PE.  We  can  see  this  as  follows:  first,  note  that  the  operator 

4h  commutes  with  H  and  d/dx,  and  so  in  this  range -independent  scenario  we  can  use  the  same 
Hamiltonian  to  propagate  the  auxiliary  field  defined  by 


v  =  s[H  ■  A  =  y  J-id/dx  ■  A  . 


At  all  finite  orders  of  the  expansion  in  A ,  this  Hamiltonian  is  Hermitian  with  respect  to  the  metric 


f  dRT 

J  P 


(recall  that  clRT  is  a  differential  element  in  transverse  space:  dR,  =  clyclz ).  Thus, 
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Thus,  the  total  downrange  flux  (which  is  to  within  one  integration  by  parts  proportional  to  J  -y-|v|  ) ,  is 
conserved,  as  is  the  energy. 

In  a  typical  implementation  of  the  stair  step  PE,  one  only  uses  the  lowest  order  boundary  conditions: 
continuity  of  A  and  of  (1 /  p)  ( dA/dz )  -  One  could  in  principle  use  higher-order  boundary  conditions  by 

explicitly  imposing  the  boundary  conditions  that  are  implicit  in  full-wave  theory.  Such  an  approach 
would  use  the  technique  developed  earlier  in  Section  3.3.2  (i.e.,  force  the  grid  to  fall  on  the  interface,  and 
then  use  the  boundary  conditions  to  generate  the  derivatives  that  go  into  the  Hamiltonian  needed  for 
downrange  stepping).  However,  this  approach  is  not  used  in  the  standard  implementation  of  the  Stair  Step 
PE.  Instead,  as  discussed  in  footnote  kk  and  Ref.  69,  one  instead  places  the  interface  in  between  two  grid 
points,  then  overlaps  the  upper  and  lower  half-spaces  by  extending  each  out  beyond  the  interface  by  one 

extra  grid  point,  and  then  uses  the  two  boundary  conditions  of  0(A)  -theory  to  solve  for  the  field  at  the 

extra  (nonphysical)  points.  The  extra  points  are  used  to  calculate  the  second  transverse  derivative  in  the 
first-order  Hamiltonian,  which  is  the  one  used  to  generate  downrange  stepping  right  at  the  surface. 
Unfortunately,  this  method  does  not  readily  generalize  to  higher-order  theory,  and  so  higher-order  theory 
has  not  typically  been  used  in  conjunction  with  the  stair  step  PE.  In  a  typical  implementation,  one 
converts  from  the  higher-order  PE  to  first-order  theory  in  a  narrow  band  surrounding  the  interface. 

The  Stair  Step  PE  has  one  clear  advantage  over  the  PE  generated  by  the  FW  transformation.  It  exploits 
the  fact  that  when  the  problem  is  truly  range -independent111,  it  is  possible  to  take  the  square  root  of  the 
wave  equation  and  propagate  the  pressure  A  directly.  Now  the  continuity  condition  is  applied  to  A  and 
not  to  the  auxiliary  field  % ,  so  Bragg-scale  vorticity  is  present  from  the  beginning  and  the  Stair  Step 
formalism  does  not  need  to  work  to  generate  it.  Furthermore,  keep  in  mind  that  although  it  has  not  been 
done  in  the  past,  there  is  no  a  priori  reason  why  boundary  conditions  that  are  implicit  in  full-wave  theory 
could  not  be  made  explicit  to  generate  a  higher-order  version  of  the  range-independent  Stair  Step  PE. 
Thus,  Collins’  Stair  Step  PE  is  clearly  the  method  of  choice  when  the  interface  is  completely  range- 
independent. 

6.2.2.2  Range-dependence  in  the  Context  of  the  Stair  Step  PE 

Let  us  now  examine  how  this  picture  changes  when  range-dependence  is  introduced  (i.e.,  when 
the  interface  acquires  tilt  and  curvature).  As  always,  energy  conservation  is  maintained  by  conserving 
the  downrange  flux  Sx.  Conservation  of  the  downrange  flux  Sx  is  achieved  by  conserving 

JS~x^^l/p-(d/dx)  ■  A  \j H/ p  ■  A  .  This  conservation  rule  must  be  explicitly  applied  globally  along 

the  entire  transverse  space  and  not  just  in  the  immediate  vicinity  of  the  interface.  This  procedure  has  the 
same  effect  as  applying  the  endpoint  transformation  to  go  from  the  pressure  field  A  to  the  carrier  of  flux 
X ,  then  applying  the  first  boundary  condition  generated  by  the  FW  theory  (continuity  of  X  )>  and  then 

transforming  back  to  A. 

As  noted  in  Section  2.2,  forcing  downrange -energy-flux  conservation  at  a  vertical  interface  leads  to  a 
discontinuity  of  the  pressure  field,  while  the  boundary  conditions  imposed  along  the  horizontal  interface 
demand  pressure  continuity.  This  leads  to  an  unphysical  discontinuity  in  the  pressure  at  the  corners  of  the 
stair  step,  which  in  turn  spawns  Gibbs  oscillations.  The  Gibbs  oscillations  are  buffered  by  introducing 
waves  that  are  evanescent  in  the  downrange  direction  (i.e.,  kK  is  imaginary).  As  noted  towards  the  end  of 
Appendix  B,  evanescent  waves  cannot  be  propagated  using  the  finite -order  PE  expansions  of  the  sort 

111  An  environment  is  truly  range-independent  if  the  sound  speed  and  density  profiles  are  independent  of  the  range  r 
and  all  interfaces  are  flat  (i.e.,  have  zero  slope). 
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generated  by  the  FW  transformation  since  the  expansion  parameter  k,/k0>  1.  However,  the  exact 

square -root  operator  can  be  inserted  into  the  Hamiltonian  by  hand,  and  this  is  indeed  what  is  done  by 
introducing  complex  Pade  coefficients  into  the  Hamiltonian.  The  evanescent  waves  do  not  harm  energy 
conservation,  because  they  have  no  downrange  flux,  but  they  contribute  to  the  total  field  and  participate  in 
the  boundary  conditions  along  the  horizontal  interface  for  some  distance  behind  a  step  (until  they  decay  to 
insignificance).  For  example,  while  they  restore  continuity  of  the  field  A  at  the  step,  as  in  the  method 
based  on  the  FW  transformation,  the  actual  physical  ( propagating )  waves  remain  discontinuous  near  a 
range-dependent  interface.  In  other  words,  the  propagating  component  of  the  field  —  as  opposed  to  the 
total  field  consisting  of  both  the  propagating  component  and  the  evanescent  waves  —  continues  to  obey 
the  same  first  boundary  condition  as  that  imposed  by  the  FW  procedure.  The  evanescent  waves  also  carry 
vertical  flux  and  contribute  the  second  boundary  condition  (i.e.,  that  on  the  derivative  of  the  field 
dA/dz  )■  Among  other  things,  this  has  the  effect  of  modifying  the  effective  boundary  condition  on  the 
derivative  of  the  downrange  propagating  component  of  the  field.  In  fact,  the  set  of  evanescent  waves  can 
be  chosen  so  that  at  a  step  the  propagating  field  precisely  obeys  the  second  boundary  condition  demanded 
by  FW  theory  as  well  as  the  first  boundary  condition. 

Note  that  once  again  as  with  the  PE  generated  by  the  FW  transformation  and  with  an  implementation 
of  the  change  of  variable  technique  near  an  interface,  the  density  discontinuity  is  effectively  buffered  in 
the  context  of  the  Stair  Step  PE  —  this  time  by  evanescent  waves.  Thus  we  see  that  by  employing 
physical  intuition  and  by  addressing  practical  considerations  imposed  on  them,  both  Tappert  and  Collins 
anticipated  the  key  philosophical  insight  that  emerges  from  the  technique  based  on  the  FW 
transformation. 

6.2.2.3  Comparing  the  Stair  Step  PE  with  that  Generated  by  the  FW  Transformation 

We  have  just  established  that  the  Stair  Step  PE  effectively  goes  over  to  first-order  FW  theory  at  the 
vertical  interface,  and  then  gradually  fades  back  to  the  range-independent  theory  as  the  evanescent  waves 
decay.  Thus,  the  Stair  Step  PE  is  in  essence  a  hybrid  that  combines  lowest-order  FW  theory  with  the 
range-independent  theory.  It  is  ideally  suited  for  cases  where  the  range  dependence  is  modest  (i.e.,  where 
there  is  relatively  little  wavelength-scale  roughness).  Indeed,  as  already  noted  above,  along  a  truly  range- 
independent  (horizontal)  interface,  the  Stair  Step  PE  correctly  generates  jumps  in  the  downrange  flux 
already  at  first  order,  and  so  if  the  interface  is  flat  or  very  nearly  so,  the  Stair  Step  PE  should  be  better 
than  the  one  generated  by  the  FW  procedure™™11.  Whenever  the  interface  is  locally  flat  and  the 
evanescent  waves  have  all  died  out,  the  Stair  Step  method  could  be  improved  even  further  by  imposing 
the  implicit  boundary  conditions  on  the  higher-order  derivatives,  and  then  using  the  procedure  outlined  in 
Section  3.3.2  to  evaluate  higher-order  Hamiltonians. 

For  a  multiscale  rough  surface  (especially  one  that  induces  Bragg  scattering),  the  PE  crosses  the 
interface  very  frequently.  If  the  interface  increments  to  a  new  level  frequently  enough,  then  the 
evanescent  waves  never  die  off,  and  the  Stair  Step  PE  essentially  remains  first-order  of  the  theory 
generated  by  the  FW  transformation™11.  Under  such  a  scenario,  the  theory  generated  by  the  FW 
transformation  has  several  advantages: 

•  It  is  only  necessary  to  perform  the  transformations  from  A  — >  %  and  back  again  from  %  — >  A 
once  at  the  endpoints  rather  than  at  each  step. 


mmm  To  be  specific,  in  this  case  it  would  take  fairly  high  orders  of  the  PE  generated  by  the  FW  transformation  to 
match  the  accuracy  of  the  Stair  Step  PE. 

nnn  Note  that  changing  the  vertical  and  horizontal  step  size  will  not  change  this  analysis.  The  decay  range  of  the 
evanescent  range  is  determined  by  the  size  of  the  vertical  step,  but  not  that  of  the  horizontal  step.  On  the  other  hand, 
if  we  reduce  the  size  of  the  vertical  step,  then  we  have  to  step  more  often  and  generate  evanescent  waves  more  often. 
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•  It  is  not  necessary  to  insert  the  evanescent  waves  by  hand;  their  presence  is  inferred  by  the  nature 
of  the  boundary  conditions  on  % . 

•  The  second  boundary  condition  of  FW  theory  effectively  selects  out  an  optimal  subset  of 
allowable  evanescent  solutions. 

•  In  the  FW  theory,  it  is  possible  to  obtain  higher  orders000.  At  higher  orders,  the  “on-shell” 
unphysical  evanescent  waves  are  replaced  by  “off-shell”  virtual  particles  (“off-shell”  solutions  do 
not  have  to  follow  the  Helmholtz  equation,  just  the  higher-order  PE)PPP.  These  produce  a  field 
that  eventually  agrees  with  the  full  wave  result  to  within  the  tolerance  of  the  uncertainty  principle. 

Note  that  the  first  two  bullets  imply  that  the  approach  based  on  the  FW  transformation  is  more  efficient. 
Furthermore,  taken  as  a  whole,  these  four  advantages  suggest  that  the  theory  based  on  the  FW 
transformation  is  the  best  method  for  modeling  the  scattering  from  multiscale  rough  surfaces  (e.g.,  diffuse 
(Bragg)  scatter). 

A  potentially  useful  hybrid  theory  would  employ  Stair  Step  PE  (Collins’  method)  along  relatively 
smooth  surfaces  until  the  surfaces  become  so  rough  that  steps  in  the  interface  become  frequent;  at  that 
point,  the  algorithm  would  go  over  to  the  theory  based  on  the  FW  transformation.  This  should  be 
developed  in  future  research. 

6.2.3  Overview  of  Available  Methods 

It  is  now  possible  to  examine  the  range  of  validity  of  the  various  approaches  to  modeling  propagation 
in  a  duct  with  rough  quasi-planar  interfaces™.  The  COV  approach  by  Tappert  (Section  6.2.1)  is  best 
when  changes  to  the  density  really  are  gradual  (on  the  scale  of  a  wavelength),  even  in  the  vertical 
direction.  The  Stair  Step  PE  by  Collins  (Section  6.2.2)  is  the  best  theory  to  use  when  the  interface  is 
range  independent  (i.e.,  without  tilt  or  curvature),  and  it  continues  to  be  the  method  of  choice  if  the  range 
dependence  is  modest  (i.e.,  the  interface  is  slowly  varying  on  the  order  of  a  wavelength).  When  quasi- 
planar  rough  surfaces  (with  features  of  a  wavelength  and  less)  are  present,  then  the  theory  based  on  the 
FW  transformation  is  the  best  one  to  use.  This  is  also  a  good  general  theory  that  can  credibly  be  applied 
to  the  previous  two  scenarios  as  well  as  to  the  rough-interface  problem  for  which  it  was  designed.  Since 
the  FW  approach  is  systematic,  it  includes  higher  orders,  and  so  it  can  be  adjusted  to  deal  with  larger 
density  jumps  than  can  alternatives  that  do  not  have  this  capability. 

For  completeness,  it  is  also  appropriate  that  we  briefly  note  two  other  methods  used  to  model  ducted 
acoustic  propagation:  double  sweep  methods  and  the  method  of  coupled  modes.  These  are  used,  for 
example,  if  vertical  interfaces  are  present  and/or  there  is  an  interest  in  calculating  backscatter.  Coupled 
modes  are  also  particularly  useful  in  considering  scenarios  such  as  the  fluid  elastic  interface  where  the 


000  Some  of  the  evanescent  waves  are  not  physical  (since,  for  example,  they  do  not  appear  in  the  full-wave  solution 
where  a  plane  wave  hits  a  tilted  interface  without  curvature).  The  correct  higher-order  theory  should  make  the  non¬ 
physical  evanescent  waves  disappear  as  the  higher  orders  are  added.  Thus,  nothing  would  be  gained,  for  example, 
by  simply  imposing  the  higher-order  implicit  boundary  conditions  for  the  full  wave  on  the  combined  evanescent 
wave/propagating  wave  solution.  It  follows  that  although  Collins’  technique  can  be  extended  to  higher  orders  along 
a  truly  range-independent  interface,  there  is  no  way  to  introduce  the  higher-order  boundary  conditions  at  the  stair 
steps.  The  new  theory  based  on  the  FW  transformation  must  then  be  used. 

ppp  For  “off-shell”  waves,  the  magnitude  of  the  wave  vector  does  not  need  to  equal  k  =a>lc  as  demanded  by  the 
Helmholtz  equation  (or  its  generalization  for  the  variable  density  problem).  Such  solutions  are  allowed,  because  the 
higher-order  PE  has  many  extra  derivatives  (compared  with  the  Helmholtz  equation),  and  it  ceases  to  operate 
precisely  like  the  standard  wave  equation.  By  contrast,  an  “on-shell”  wave  has  a  wave  vector  with  a  magnitude  that 
is  compatible  with  the  given  Helmholtz  equation. 

qqq  In  this  report,  the  terms  “waveguide”  and  “duct”  are  more  or  less  used  interchangeably.  For  some  authors, 
waveguides  are  associated  with  an  arrangement  of  impenetrable  interfaces  that  confine  and  direct  the  propagation  of 
a  wave  and  ducts  with  environmental  profiles  that  achieve  a  similar  effect  without  the  presence  of  interfaces. 


(zero)  speed  of  secondary  waves  in  the  fluid  create  a  (dimensionless)  PE  expansion  parameter  of  one  (see 
Section  7.2). 


For  an  overview  and  comparison  of  available  methods  for  propagating  classical  fields  propagating 
through  a  ducted  environment  in  the  presence  of  rough  interfaces,  see  Table  3.  Tappert’s  and  Collins’ 
approaches  are  discussed  in  Sections  2.2,  6.2.1,  and  6.2.2.  Bremmer’s  approach  is  discussed  in  Ref.  20. 
Modes  with  rough  boundaries  have  been  extensively  discussed  by  Kuperman,  Schmidt,  and  collaborators 
[65,  100-104],  The  approach  used  is  a  hybrid  between  mode  theory  and  perturbative  rough  surface 
scattering  theory.  The  use  of  impedance  boundary  conditions  to  characterize  propagation  through 
waveguides  with  rough  boundaries  has  been  examined  by  Berman  [105,  106]. 


Table  3  —  The  Optimal  Use  of  the  Various  Approaches  for  Modeling  Downrange 
Propagation  in  a  Ducted  Environment 


Underlying 

Approach 

Tappert’s  approach 

(uses  new  field  variable  u  =  A/Vp) 

Density  changes  as  a  function  of  the  range 
and  depth  really  are  gradual 

"0 

&3 

G3 

cr 

Stair  Step  PE  (Collins  [1991]) 

(use  range  independent  theory  +  conserve  energy  at 
interface  steps;  use  evanescent  solutions  to  eliminate 
Gibbs  oscillations) 

Interface  is  allowed  to  vary  but  there  is  little 
wavelength-scale  roughness  (density  jumps  allowed) 

May  be  adequate  for  wavelength-scale  roughness 

o 

o' 

Theory  based  on  the  Foldy-Wouthuysen 
transformation 

A  good  general  theory  that  can  credibly  be  applied  to  both 
the  scenarios  above 

m 

(developed  systematically  using  canonical 

Stochastic  Problem 

X3 

C 

transformations;  can  be  extended  to  higher 

If  2nd  order  correction  is  needed 

03 

f  1 

orders) 

Quasi-planar  penetrable  rough  interfaces  (Bragg  scattering 
and  density  jumps  allowed) 

12 

Double  Sweep 

(Bremmer  Series:  uses  full  wave  solution 
locally  at  the  interface;  uses  PE  to  propagate 
between  vertical  interfaces) 

ft 

We  are  interested  in 
backscatter 

ft 

Vertical 

interfaces 

Coupled  Modes 

ft 

ft 

Full  Wave 

These  become  numerically  expensive  unless 
the  range  dependence  is  very  modest 

For  aspects  of  the  stochastic  rough  surface 
component  of  the  approach,  see  Kuperman, 
Schmidt. 

Very  large  jumps  in  the  physical  parameters 

[rough  surfaces  are  treated  by  stochastic  theory  and  this 
component  of  the  range-dependence  effectively  goes 
away;  other  range  dependence  (e.g.,  from  deterministic 
large  scale  features)  is  modest  -  otherwise  the  approach 
becomes  very  expensive] 

Impedance  boundary  conditions 
(Berman) 

Range-dependence  is  stochastic 

Applies  to  backscatter 

7.  SIMILAR  EFFECTS  IN  THE  ELECTROMAGNETIC  AND 
ELASTODYNAMIC  PROBLEMS 

This  section  adapts  the  formal  approach  developed  for  the  acoustic  field  with  varying  density  to 
electromagnetic  and  elastodynamic  (i.e.,  elastic)  wave  propagation.  These  results  will  be  very  useful 
since  there  are  currently  unresolved  issues  in  applying  such  vector  field  PEs  to  tilted  interfaces,  and  the 
results  have  many  possible  applications,  including  fiber  optics,  radio/radar  (these  involve  electromagnetic 
fields),  and  seismology  (which  involves  elastic  waves).  Electromagnetic  fields  are  considered  in  Section 
7.1  and  elastic  waves  are  examined  in  Section  7.2. 
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The  discussion  below  sets  up  the  problem  by  deriving  the  state  space  equation  and  taking  a  brief  look 
at  the  transformations  connecting  the  familiar  full-wave  fields  with  the  associated  auxiliary  fields  that  are 
propagated  by  the  PE.  Full  development  of  the  theory  for  the  sake  of  practical  implementations  will  not 
be  pursued  below.  However,  beginning  with  the  state  space  equation  and  the  transformation  between  the 
two  fields  (i.e.,  between  PE  and  full-wave  fields),  it  should  be  a  strictly  mechanical  procedure  to  fully 
develop  the  theory.  Indeed,  the  first-order  PE  can  be  immediately  read  off:  it  is  simply  the  even  part  of 
the  state  space  equation. 

It  turns  out  that  jumps  in  electric  and  magnetic  permeability  (in  electromagnetic  theory)  and  in  the 
second  Lame  parameter  (in  the  theory  of  elastic  waves)  will  involve  issues  that  are  similar  to  those 
associated  with  density  jumps  in  the  acoustic  problem  (i.e.,  the  issues  associated  with  Bragg-scale 
vorticity). 

7.1  The  Electromagnetic  Field 

7.1.1  The  Foldy-Wouthuysen  Ansatzfor  the  Electromagnetic  Field 

The  discussion  at  the  beginning  of  Appendix  I  provides  general  guidelines  for  constructing  an  ansatz 
that  will  lead  to  a  state  space  equation  that  is  a  suitable  stalling  point  for  the  FW  transformation. 
Following  these  guidelines,  the  FW  ansatz  for  the  electromagnetic  field  is  (in  MKS  units): 


0  = 


'  0T' 
\%t) 


=  -|  Ht±J%xxE 

Mo 


(122) 


where  £0  is  the  reference  electric  permittivity  and  jUQ  is  the  reference  magnetic  permeability.  As  always, 
the  x  -axis  denotes  the  downrange  direction.  The  subscript  T  indicates  that  the  given  vector  only 
includes  components  in  the  transverse  (i.e.,  y-z )  dimensions.  HT  is  the  projection  in  (transverse)  y-z 
space  of  the  magnetic  field  H  and  E  is  the  electric  field.  The  underlying  tilde  in  O  serves  to  remind  us 
that  this  quantity  now  has  four  rather  than  two  components.  To  within  a  constant,  O  is  indeed  a  carrier 
of  the  flux: 

0'0  =  e;  ■0t-Zt-Zt=  5ave  •  X ,  (123) 

where,  as  before,  S  is  the  time-averaged  energy  flux  associated  with  the  field,  and  Sx  =  S  ■  x  is  its 
downrange  component. 

7.1.2  The  State  Space  Equation  for  the  Electromagnetic  Field 

This  ansatz  is  used  to  generate  the  state  space  equation.  This  equation  involves  the  following 
operators: 


AJk  = 

yjk  =  - 


J-V‘  Sjk  — l  V* 
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where  k,js(  1,2)  label  the  transverse  directions  y  and  z,  8jk  is  the  Kronecker  delta,  and 


Se  =  £-£0 

Sju  =  ju-M0 

k0  —  —  CO^£0/J0 

Co 

A  and  y  arc  2x2  matrix  operators.  To  remind  us  that  they  are  no  longer  scalars,  they  are  underlined. 
As  derived  in  Appendix  R.  1 ,  the  state  space  equation  for  the  electromagnetic  field  is 
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This  result  assumes  that  the  free  current  J  f  is  zero  (i.e.,  the  material  is  nonconducting).  It  would  not  be 
difficult  to  incorporate  the  correction  for  a  linear  isotropic  conductor  such  that  J  f  =  <jE  . 

Note  that  in  Eq.  (125)  we  have  included  the  definitions  for  E,  and  //  (Eqs.  (8)  and  (11)).  The  odd  and 
even  operators  are  now  4x4  matrices  defined  by  the  outer  products 

0  =  A®£ 

£  =  (A-2y}®i)' 

To  remind  ourselves  that  the  odd  and  even  operators  are  now  4x4  rather  than  2x2  matrices,  these 
operators  are  now  underlined  with  a  tilde  ~  .  Recalling  that  O  is  a  4-component  vector,  the  state 
equation  can  be  written  in  the  standard  form  (cf.  Eq.  (9)) 

iQ  =  (O  +  £  +  k0l®ri)& . 

dx 

Just  as  for  the  Dirac  equation  (and  harking  back  to  the  discussion  just  above  Eq.  (16)  in  Section  3.1),  the 
even  and  odd  properties  of  O  and  £  are  defined  by  their  commutation  properties  vis-a-vis  the  block 
diagonal 


V  0  ^2x2 J 


(126) 


To  avoid  confusion,  the  corresponding  one-way  Hamiltonian  will  not  be  denoted  by  H  ,  but  by  the 
underlined  2x2  matrix  operator  H  (its  matrix  character  is  denoted  by  the  underline  and  the  font  is  used 

to  clearly  distinguish  the  Hamiltonian  H  from  the  magnetic  field  H  ).  Although  it  is  now  a  4x4  matrix 
operator,  the  2-way  Hamiltonian  will  still  be  denoted  by  7i  . 
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7.1.3  The  Transformation  Between  the  Auxiliary  Field  0T  and  the  Magnetic  Field  HT 

Next,  we  examine  the  approximate  transformation  connecting  the  transverse  component  of  the  familiar 
full-wave  magnetic  field  FIT  with  the  corresponding  auxiliary  field  0T  that  is  associated  with  downrange 
propagation  by  the  PE.  As  shown  in  Appendix  R.2,  the  two  fields  are  approximately  connected  by  the 
equation 
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where 
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Note  that  A  is  similar  to  other  scalar  A  ’s  defined  above,  but  does  not  precisely  match  any  of  them.  To 
properly  use  Eq.  (127),  expand  in  Se/e0  and  X  to  get  a  finite-order  transformation.  The  downrange 

component  of  the  magnetic  field  H x  and  the  electric  field  E  can  be  deduced  from  0r  (or  more  generally 
0T  —  XT  if  the  uprange  field  %T  is  non-zero)  and  H ,  respectively,  using  Maxwell’s  equations™. 


Equation  (127)  (with  definition  in  Eq.  (128))  neglects  the  range-  and  transverse-  (e.g.,  vertical) 
dependence  of  £  and  U  locally  in  the  immediate  vicinity  of  the  point  in  space  where  the  transformation 

is  performed.  We  usually  only  need  to  transform  between  the  physical  field  HT  and  the  PE  field  0T  at 
the  endpoints  of  the  propagation.  The  endpoints  of  the  propagation  are  at  the  values  of  the  range  and 
transverse  coordinates,  where  the  source  and  receiver  are  located.  Elsewhere,  the  field  is  only  “passing 

through,”  and  only  the  auxiliary  field  0T  is  needed.  In  other  words,  Eq.  (127)  (with  definition  in  Eq 

(128))  applies  in  a  typical  implementation,  where  we  only  transform  between  HT  and  0T  at  a  source  or 
receiver,  and  additionally  have  £  and  LI  slowly  varying  in  the  immediate  neighborhood  of  the  source 
and  receiver.  Equation  (127)  also  assumes  that  the  material  is  a  linear  and  isotropic  such  that  JLlFl  =  B  , 
where  B  is  the  magnetic  induction.  As  with  the  PE  (Eq.  (125)),  it  is  assumed  that  the  free  current  J  f  is 
zero  (i.e.,  the  material  does  not  conduct). 


Note  that  this  transformation  does  not  involve  near-eigenvalues,  so  it  will  have  to  evaluated  even  in  the 
range-independent  case.  In  fact,  the  behavior  of  the  field  0T  can  be  quite  different  from  that  of  HT . 

However,  as  shown  at  the  end  of  Appendix  R.2,  if  we  have  very  shallow  grazing  angles  such  that  Vr  is 
effectively  “small,”  then  we  have  a  “minimal  correction”: 


0T  =(ju/£)'/4Ht  +0 


( 


V 


(129) 


rrr  E.g.,  Eq.  (R.6)  (derived  using  Maxwell’s  equations)  gives  Hx  °c  Vr  •  ( 6r  - %T ) ,  and  we  also  have  V x  II /icoe  =  E 
(assuming  that  the  conductivity  is  zero);  see  Eq.  (R.l)  for  a  list  of  Maxwell’s  equations. 
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For  almost  all  materials  //  ~  jUQ ,  and  so  fl  ■  effectively  acts  like  a  constant  that  cancels  at  the  endpoints. 
In  this  case,  the  minimal  correction  is  0T  =  (  X/ £)  '  HT . 

7.2  The  Elastodynamic  Field 

7.2.1  The  Foldy-Wouthuysen  Ansatzfor  the  Elastic  Field 

The  basic  environmental  parameters  that  will  appear  in  this  treatment  of  the  elastodynamic  problem 

SSS 

are 

u  =  displacement  vector  co  =  frequency  p  =  density 
?i.u  are  Lame  parameters 

cs  =  Jj;  =  speed  of  shear  waves  cp  =  speed  of  pressure  waves 

fij=  stress  tensor  =  +ju(y'uJ  +  V-V)  (130) 

Aq./Jq,  fi0  are  reference  values  for  the  Lame  parameters  (usually  p0=  ft0) 
p0  is  reference  density 

s  _  [mo  P  _  I  A)+2Mo 

L0  V  A,  0  V  P«  • 

Once  again,  we  use  the  discussion  at  the  beginning  of  Appendix  I  to  guide  us  in  constructing  an  ansatz 
that  will  lead  to  a  state  space  equation  that  is  a  suitable  starting  point  for  the  FW  transformation.  The 
appropriate  ansatz  for  elastodynamic  waves  is 
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(131) 


Equation  (131)  must  be  multiplied  by  ^j(Op{]cs{)  to  form  a  true  carrier  of  the  flux. 

7.2.2  The  State  Space  Equation 

The  state  space  equation  for  the  elastodynamic  field  will  also  involve  a  number  of  additional 
parameters  derived  from  the  environmental  parameters  in  Eq.  (130): 
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sss  Note  that  dispersive  effects  take  the  form  of  complex  components  to  environmental  parameters  such  as  the 
density  and  the  Lame  parameters  (i.e.,  the  sound  speeds). 
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These  quantities  are  used  to  define  a  set  of  operators  that  appear  in  the  state  space  equation.  Some  will 
play  a  familiar  role  and  are  labeled  accordingly: 


AJk  = 


+%»%(**  -  s°js°k ) + vkT/NjT ) 
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while  others  will  play  entirely  new  roles: 
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(133) 


Note  that 

k,j  g  (1, 2)  -  label  transverse  directions  y  and  z 
k,j  =  0  -  labels  downrange  direction  x 


As  discussed  in  Appendix  S.  1.2.1  to  use  straightforward  5  -function  bifurcation  at  an  interface,  we  will 
need  to  make  the  change  of  variable: 


However,  in  the  current  treatment,  we  will  stick  with  the  familiar  Lame  parameters  A  and  U  . 


As  derived  in  Appendix  S.1.1,  the  state  space  equation  for  the  elastodynamic  field  is 
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Note  the  familiar  2x2  matrices  E, ,  Ij  and  the  unit  matrix  1  have  been  labeled,  as  is  the  new  matrix 

.  (o  n 

T  =  r/g  = 

1  0 

v1  u7 

In  outer-product  notation  (see  the  treatment  of  the  electromagnetic  field  in  Section  7.1),  the  odd  and  even 
operators  are  given  by 

O  =  +  k'®t  |  outer  product  defines  6x6  matrices 

£  =  (A-2y)  ®t/  +  (/c-2/})®  lj  A ,y,K  and  /?  are  3x3  matrices 


Now,  the  tilde  reminds  us  that  these  are  6x6  matrices.  “Even”  and  “odd”  are  defined  as  commutation 
and  anti-commutation,  respectively,  vis-a-vis  the  block-diagonal 

fl3x3  0  ^ 


Similarly,  we  have  the  six-component  vector: 


o  = 


fe" 


and,  as  always,  we  can  write  the  result  in  standard  form: 

.90 


i-^  =  (0  +  £  +  k’\®ri)<b 


In  Eq.  (132),  there  appears  a  new  kind  of  expansion  parameter: 
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To  understand  the  meaning  of  this  parameter,  note  that  PEs  not  only  select  out  a  preferred  axis  (the 
downrange  x  -axis),  but  they  also  select  a  preferred  wave  number  k0  (i.e.,  the  overall  reference  wave 

number).  For  the  elastodynamic  waves,  it  is  convenient  to  choose  as  the  preferred  wave  number  kQ  the 
reference  wave  number  for  the  shear  waves  k$  .  The  expansion  parameter  (Eq.  (135))  is  intrinsic  to  the 
elastic  PE,  and  it  reflects  the  discrepancy  between  the  preferred  wave  number  and  the  reference  wave 
number  associated  with  the  primary  (or  pressure  or  p-)  waves:  k/’ .  It  has  much  in  common  with  the 
expansion  parameter  associated  with  the  acoustic  (in  this  example  constant  density)  PE 
JU  —  /t2)  =  ^l  —  (c0/c)~j,  which  reflects  the  discrepancy  between  the  local  wave  number  k  and  the 

reference  wave  number  k0  .  However  unlike  /.l ,  the  expansion  parameter  (Eq.  (135))  is  even  present  for 

a  downrange  plane  wave  (i.e.,  one  without  any  transverse  (y,z)  -dependence)  embedded  in  a  completely 

homogeneous  medium.  This  is  the  first  time  we  have  encountered  a  PE  parameter  that  does  not  vanish  in 
this  basic  case.  Such  expansion  parameters  will  occur  whenever  the  PE  is  applied  to  a  field  that  has  more 
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than  one  characteristic  wave  number.  Since  the  p-wave  speed  is  generally  around  two  times  the  s-wave 
speed,  the  presence  of  the  term  (Eq.  (135))  suggests  that  the  PE  for  the  elastic  solid  will  tend  to  require 
fairly  high  orders.  Finally,  it  is  worth  noting  that  in  the  presence  of  a  fluid-solid  interface  the  expansion 
parameter  (Eq.  (135))  goes  to  1  (with  the  usual  conventions;  other  conventions  will  cause  even  more 
severe  problems)  and  this  limit  is  consequently  quite  problematical  for  the  PE.  The  reasons  for  this  are 
discussed  in  footnote  yyyyy  in  Appendix  S.  1.2.2.  (More  generally,  a  number  of  subjects  addressed  in  this 
paragraph  are  examined  in  further  detail  in  Appendix  S.l.2.2.) 

7.2.3  The  Transformation  Connecting  the  PE  Field  %  to  the  Displacement  Vector  u 


The  endpoint  correction  for  the  elastic  PE  (tying  the  familiar  full-wave  displacement  vector  u  to  the 
corresponding  auxiliary  field  %  that  is  associated  with  downrange  propagation  by  the  PE)  is  quite 
difficult  to  obtain,  and  for  the  present  puiposes  as  well  as  for  most  conceivable  future  applications,  the 
first-order  (in  Vr )  is  sufficient.  This  result  is 
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This  result  is  derived  in  Appendix  S.2. 


When  X,jU  and  p  are  only  locally,  hut  not  globally,  constant,  then  Vr  is  no  longer  an  observable 

(i.e.,  V  0 )  even  in  the  range -independent  case,  so  there  is  no  cancellation  of  endpoint 

contributions  in  a  range-independent  environment,  and  of  course  no  near-cancellation  in  a  weakly  range- 
dependent  environment.  This  is  similar  to  the  electromagnetic  case.  However,  since  for  shallow  grazing 

angles  Vr  is  “small”  and  the  endpoint  contribution  does  not  accumulate,  it  may  be  possible  to  get  away 

using  the  0th  order,  but  use  of  the  first  order  in  Vr  is  the  safer  practice.  Note  that  the  second  order  (i.e., 

O  (V/  ) )  is  the  lowest  order  contribution  in  Vy.  when  one  assumes  that  the  medium  parameters  are 

locally  constant  for  the  electromagnetic  and  acoustic  fields,  so  the  first-order  effect  appearing  in  Eq.  (137) 
is  an  unusually  big  endpoint  effect  associated  with  transverse  differentiation.  Note  that  in  considering 
“minimal”  endpoint  corrections,  we  have  generally  kept  first  order  in  jumps  in  the  medium  parameters. 
Thus,  it  is  best  to  consider  the  expression  above  as  a  “minimal”  endpoint  correction  much  like 

\J~PoJp  ~  1  —  A//2Pii  in  the  two-fluid  acoustic  problem. 


This  completes  our  examination  of  the  PE  for  vector  fields. 
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8.  SUMMARY  AND  FUTURE  TRENDS 

In  this  section,  the  core  findings  of  this  report  are  summarized  in  Section  8.1,  and  finally,  desirable 
future  extensions  of  this  research  are  explored  (Section  8.2).  The  latter  includes  both  plans  to  implement 
the  new  formalism  and  extensions  of  the  theory. 

From  a  practical  point  of  view,  the  chief  goal  of  the  effort  outlined  in  this  work  has  been  to  tie  the 
acoustic  PE  technique  to  rough  surface  scattering  theory.  As  the  surface  roughness  extends  through  the 
wavelength  scale,  it  behaves  like  a  diffraction  grating  and  leads  to  what  is  called  Bragg  scattering.  This  is 
a  nontrivial,  phase-sensitive  problem  that  involves  theoretical  and  computational  challenges  that  go 
beyond  those  found  in  problems  to  which  the  PE  is  most  typically  applied.  Although  there  has  in  the  past 
been  some  success  in  applying  the  PE  to  impenetrable  rough  surfaces,  this  record  has  until  now  not  been 
matched  for  penetrable  rough  surfaces.  In  this  report,  the  FW  transformation  has  been  used  to  design  a 
PE  that  addresses  this  challenge.  The  paradigm  employed  is  based  on  the  nonrelativistic  theory  of  the 
quantum  Lamb  shift,  where  a  PE  (the  Schrodinger  equation)  was  used  to  model  a  field  near  a  rough 
surface  (the  world  line  of  the  hydrogen  nucleus  advected  by  vacuum  fluctuations).  The  applicability  of 
this  technique  to  the  acoustic  problem  was  established  by  examining  the  direct  classical  analogy  to  the 
atomic  Lamb  shift.  Then,  the  technique  was  extended  to  describe  Bragg  scattering  from  a  rough  interface 
where  the  density  jumps  (such  as  is  common  at  the  water-sediment  interface).  The  PE  derived  in  this  way 
exploits  higher-order  boundary  conditions  to  buffer  the  density  discontinuity  in  a  manner  precisely 
dictated  by  the  formalism. 

The  results  of  this  work  complement  the  current  state-of-the-art  PE,  and  suggest  a  promising  new  line 
of  development.  This  approach  is  particularly  well-suited  for  addressing  a  variety  of  important  scattering 
problems-particularly  those  where  the  density,  electric  permittivity,  magnetic  permeability,  or  the  Lame 
parameters  vary  rapidly  as  a  function  of  the  transverse  coordinates,  while  the  range  dependence  involves 
spectral  scales  down  to  the  Bragg  wave  number.  This  scenario  includes  rough  interfaces  where  these 
parameters  jump  such  as  the  ocean  bottom. 

8.1  Summary 

The  new  PE  applicable  to  a  rough  interface  where  the  density  jumps  has  the  most  immediate  practical 
relevance  to  the  field  of  underwater  acoustics,  and  it  therefore  constitutes  the  core  of  this  report.  The  key 
equations  needed  to  implement  this  PE  are  presented  by: 

•  Equation  (95),  the  Hamiltonian  in  the  half  spaces  (which  can  be  resummed  to  give  the  familiar 
Hamiltonians  in  the  half  spaces,  but  the  error  terms  will  involve  powers  of  the  new  expansion 
parameters  A  and  2^-these  parameters  are  given  by  Eq.  (87)), 

•  Equation  (97),  the  transformation  good  in  the  half  spaces  that  takes  us  from  the  auxiliary  field 
associated  with  the  PE  %  to  the  physical  pressure  field  A  , 

•  and  Eq.  (120),  the  second  order  O  [A1  j  boundary  conditions  applicable  along  a 
density/compressibility  jump. 

The  method  for  evaluating  derivatives  at  the  interface  described  in 


Fig.  10  is  needed  for  a  numerical  implementation  of  the  new  deterministic  theory. 

Figure  19  traces  the  development  of  this  core  result.  These  topics  are  covered  in  Sections  3.1,  3.3,  5.1, 
5.3,  and  6.1  of  this  report.  The  mechanics  of  discretizing  the  problem  are  discussed  first,  and  it  is 
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established  that  the  downrange  stepping  procedure  requires  that  boundary  conditions  be  specified  on  the 
interface.  The  FW  transformation  is  used  to  generate  a  PE  for  the  auxiliary  field  that  carriers  the 
downrange  flux,  complete  with  the  needed  boundary  conditions.  When  this  approach  is  applied  to  an 
interface  where  the  density  jumps,  the  jump  in  energy  flux  becomes  buffered  in  the  PE.  (In  the  full-wave 
equation,  the  tangential  component  of  the  flux  jumps  at  an  interface  where  the  density  jumps.) 


The  mechanics  of  stepping  downrange,  and  the  related  need  for  boundary  conditions  (Section  2.3) 


'interface 


Boundary  conditions  used 
to  evaluate  difference  at 
the  orphaned  grid  points 


A.  Using  the  PE  to  step  downrange  such  that 
the  interface  always  falls  right  on  the  grid. 


B.  Using  boundary  conditions  in  finite  difference 
schemes  to  evaluate  derivatives  right  on  the  interface  and 
so  get  the  Hamiltonian  needed  for  downrange  stepping. 


Using  the  Foldy-Wouthuysen  transformation  to  obtain  boundary  conditions 


Ansatz: 

field  =  j 

energy  flux\ 
v=>  energy  flux  J 

FW  transformation: 

*-(i 

U  ■',)» 

Result: 

|  Scalar  PE  with  scalar  Hamiltonian  H  \ 

lx,  „  IXn 

~\  n  ~\  n 

dz  dz 


Rough  world  line  of  a  nucleus 
advected  by  vacuum  fluctuations 


Rough  boundary  between 
media  in  a  waveguide 


A.  Use  a  series  of  canonical 
transformations  to  generate  the  PE:  i.e.. 
the  Foldy-Wouthuysen  transformation 
adapted  to  acoustics  (Sections  2.1  and 
4.1). 


B.  The  Foldy-Wouthuysen 
transformation  generates  boundary 
conditions  along  discontinuities 
(Sections  2.2,  4.3  and  5.1). 


C.  Validate  the  approach  by  considering 
the  classical  field  phenomenon 
analogous  to  the  atomic  Lamb  shift 
(Sections  2.2,  3.3  and  4.3). 


A  new  suite  of  boundary  conditions  along  a  density  jump  for  the  higher  order 
PE.  These  effectively  buffer  the  density  jump  much  like  an  electron  cloud 
buffers  an  atomic  nucleus  (Section  5.1).  (Key  result ) 


Flux  => 

Flux  => 


interface 


Flux 


Flux 


interface 


Fig.  19  —  A  schematic  summary  of  the  most  important 
and  results  introduced  in  this  report 


As  noted  in  Fig.  19,  the  classical  Lamb  shift  places  our  formal  efforts  into  context  and  enhances  the 
credibility  of  the  results  (Sections  3.2,  4.1.  4.2,  4.3,  and  4.4).  The  key  result  is  the  classical  Lamb  shift 
induced  by  the  combined  effect  of  sound  speed  cusps  and  sound  speed/density  jumps  (Eq.  (105) 
complemented  by  Eqs.  (95)  and  (97)).  The  quasi-first-order  stochastic  boundary  conditions  for 
(separately)  a  sound  speed  cusp  (Eq.  (40)),  the  sound  speed  jump  (Eq.  (82))  and  a  sound  speed/density 
jump  (Eq.  (103))  provide  examples  of  the  classical  Lamb  shift,  and  when  compared  to  the  deterministic 
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quasi-first-order  boundary  conditions  (Eqs.  (31),  (72)  and  (99),  respectively),  they  indicate  that  the  new 
FW  term,  which  is  identified  with  “vacuum  polarization”  in  Section  4.2.1,  indeed  involves  a  kind  of 
smearing  associated  with  (imaginary  -  i.e.,  «i)  displacements  in  the  uprange/downrange  direction. 
Thus,  the  examination  of  the  classical  Lamb  shift  reinforces  the  notion  that  the  FW  transformation  can 
smear  a  singularity  -  even  in  the  deterministic  problem.  In  addition  to  the  field-theoretic  interpretation, 
the  new  FW  term  can  also  be  explained  in  geometrical  terms  by  tracing  it  back  to  the  effects  of  tilt  on  the 
normal  derivative  in  the  full-field  boundary  conditions  (Section  4.2.1). 

The  discussion  of  the  classical  Lamb  shift  is  further  reinforced  when,  in  the  formalism  for  the  acoustic 
problem,  it  is  used  to  create  a  toy  model  of  the  atomic  Lamb  shift  (Section  3.2.4).  The  classical  Lamb 
shift  is  connected  to  boundary  waves  (Section  4.4.4).  Curvature-induced  boundary  waves  are  induced  by 
the  new  FW  term  (Sections  5. 3. 3.4  and  5. 3. 3. 5).  There  is  a  connection  between  these  boundary  waves  and 
Biot-Tolstoy  boundary  waves  (Section  5. 3. 3. 5). 

Going  beyond  topics  mentioned  in  Fig.  19,  Section  5.2  addresses  the  important  technical  question  of 
what  to  do  when  the  formalism  seems  to  multiply  distributions,  Section  6.2  places  the  results  into  the 
context  of  other  work  in  the  field  (e.g.,  see  Table  3  for  an  overview),  and  Section  7  extends  the  basic 
approach  to  electromagnetic  and  elastic  fields. 

8.2  Future  Trends 

Now  that  the  underlying  formalism  has  been  developed,  a  number  of  interesting  extensions  are 
possible. 

8.2. 1  Direct  Applications  of  Results  in  this  Report 

A  researcher  applying  this  formalism  to  acoustic  modeling  could  begin  by  directly  applying  the  results 
developed  in  this  report. 

Acoustic  Lamb  shift.  An  initial  examination  of  this  effect  associated  with  cusps  in  the  sound  speed 
and/or  jumps  in  the  density  and  sound  speed  would  be  based  on  the  stochastic  quasi-first-order  O^A,  X) 

theory  reflected  in  boundary  conditions  (Eq.  (105)).  Such  an  effort  would  build  on  the  preliminary 
modeling  results  presented  in  Section  5.4.2.  These  results  could  be  expanded  to  include  the  effects  of 
classical  vacuum  polarization.  An  extensive  parameter  study  would  reveal  scenarios  where  the  classical 
Lamb  shift  could  play  a  significant  role  in  shallow  water  propagation.  Specifically,  a  closer  study  of 
localization  induced  by  sound  speed  cusps  would  be  very  worthwhile,  since  it  could  suggest  ways  to 
exploit  signals  trapped  by  this  effect  to  deploy  sonar  more  effectively. 

Acoustic  “vacuum  polarization”  within  the  water  column.  Previously,  quasi-first-order  O^A,/i\ 

theory  was  used  to  evaluate  the  effects  of  classical  “vacuum  polarization”  in  (deterministic)  long-range 
propagation  in  deep  water.  It  was  assumed  that  range  dependence  is  induced  as  the  sound  speed  profile  is 
advected  by  internal  waves.  Wurmser  et  al.  assumed  single-scale  internal  waves,  but  Frank  Henyey 
(APL-UW)  has  pointed  out  that  a  power  law  is  much  more  realistic  and  the  effect  should  be  larger  for 
such  a  power  law  [107],  Furthermore,  Henyey  has  also  suggested  that  classical  “vacuum  polarization” 
might  be  significant  in  the  context  of  bubble  clouds,  since  the  scales  of  the  bubbly  medium  and  acoustic 
wavelength  are  closely  matched  [107].  These  hypotheses  should  be  examined  in  the  future. 

Bragg  scattering.  Initially,  second-order  O (/t:  j  theory  could  be  used  to  model  scattering  from  a 
deterministic  rough  interface  characterized  by  a  density  and  compressibility  jump.  This  study  would  be 
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based  on  boundary  conditions  (Eq.  (120)),  and  it  will  proceed  with  the  problem  of  scattering  from  the 
ocean  bottom  in  mind.  It  would  be  useful  to  benchmark  the  results  against  finite  difference  solutions 
and/or  against  controlled  experiments  (perhaps  using  scale  models).  Since  the  ratio  of  the  jump  in  the 
density  relative  to  the  reference  density  is  an  expansion  parameter  and  the  theory  has  initially  been 
restricted  to  low  orders,  initial  studies  should  begin  by  considering  density  jumps  that  are  somewhat 
smaller  than  the  factor  of  two  jump  typical  for  a  water-sand  bottom.  This  will  allow  a  preliminary, 
qualitative  investigation  of  subcritical-angle  penetration  into  the  ocean  bottom.  The  theory  could  also  be 
used  to  examine  the  effects  of  layered  bottoms  on  the  acoustic  field. 

8.2.2  Extensions  of  the  Results  in  this  Report 

Straightforward  extensions  of  the  formalism  derived  in  this  result  can  lead  to  several  promising  new 
avenues  of  research. 

Higher-order  theories.  Once  the  second-order  O  (22  )  version  of  the  theory  is  fully  implemented  and 

understood,  then  the  higher-order  theory  (e.g.,  third-order  C)(  /t5  )  theory)  can  also  be  developed  and  used 

in  a  more  definitive  study  of  subcritical  angle  penetration  into  the  ocean  bottom.  At  some  point  in  the 
future,  the  formalism  could  also  be  extended  to  include  two-dimensional  surfaces  embedded  in  the  full 
three-dimensional  space. 

As  noted  in  the  final  two  paragraphs  of  Section  6.1.5,  second/third-order  hybrid  O (22,2)  and  the 
pure  third-order  O  (/t\  2)  theories  in  both  their  deterministic  and  stochastic  versions  will  be  interesting 

and  worthwhile,  as  will  consideration  of  the  stochastic  version  of  the  basic  O  (A2  j  theory.  The 
stochastic  theories  will  involve  both  “losses”  due  to  Bragg  scattering  and  Lamb  shift-type  effects.  The 
0(2)  terms  introduce  the  classical  equivalent  of  the  vacuum  polarization  effect  known  in  atomic 
physics. 

Variable  densities.  At  some  point  in  the  future,  the  possibility  that  p  varies  in  the  half  spaces  should 

be  allowed,  and  consideration  given  to  the  possibility  that  there  may  be  steps  in  Vr/?  and  in  at  the 

interface.  Such  singularities  will  lead  to  new  stochastic  effects  similar  to  the  classical  Lamb  shifts 
discussed  above. 

Other  fields.  The  new  approach  based  on  the  LW  transformation  has  also  been  extended  to  elastic  and 
electromagnetic  fields.  It  would  be  best  to  use  these  results  at  first  to  consider  media  without  interfaces, 
and  then  expand  the  approach  to  consider  interfaces.  The  elastic -wave  theory  can  be  developed  with  an 
eye  toward  seismic  and  seismo-acoustic  applications.  It  remains  to  be  seen  whether  this  theory  can  be 
applied  to  a  fluid-elastic  boundary  such  as  appears  along  an  ocean  bottom  consisting  of  an  interface 
between  water  and  solid  rock  (e.g.,  see  footnote  yyyy  in  Appendix  S. 1.2.2). 

Data.  Linally,  all  of  these  versions  of  the  theory  will  have  to  be  thoroughly  compared  to  data. 
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A  Appendix:  %  as  a  carrier  of  downrange  flux 


It  is  shown  in  [1]  that  the  FW  procedure  by  design  ties  the  conservation  rule  for 
J  dRT  \x\  to  the  conservation  of  the  downrange  flux,  and  so  to  the  conservation  of  energy. 

This  connection  also  helps  us  understand  the  relationship  between  the  auxiliary  field  % 
and  the  pressure  field  A  .  Here  we  examine  the  case  of  constant  density.  It  can  be  shown 
that  x  ~  y]%0 '  ^  (see  the  second  to  the  last  paragraph  of  Section  2.1,  the  discussion 
around  equation  (3.14)  in  Section  3.1  and  most  of  all  Appendix  D.l),  which  can  be 
rewritten  in  the  suggestive  form  X  =  ^  (where  recall  functions  of  operators 


are  understood  as  expansions).  To  further  illuminate  the  interpretation  of  X  as  a  kind  of 
carrier  of  flux,  take  \x[  =  X*  Zx  (h'^A)  (h'^a),  split  it  in  halves,  integrate  over 

transverse  space,  and  integrate  the  halves  by  parts  each  in  the  opposite  direction  from  the 
other  to  get 


J dRT  \xf  °c  J dRT  y2(HA)*  A +  y2  A* (HA)  =  j dRr  Im [/f  •  (dd/dx)]  «  j dRrSx  (A.l) 


( Sx  is  the  downrange  flux).  In  the  eigenvalue  problem  (i.e.,  when  the  duct  is  range 
independent),  the  eigenfunctions  X„  form  a  basis  for  the  solution  set:  J  =  ^cnX„  (c„  are 


constants),  and  each  eigenfunction  is  (to  within  a  phase)  the  actual  square  root  of  the 
corresponding  (downrange)  flux.  (The  basis  can  be  made  orthogonal  using  the  Graham- 
Schmidt  orthogonalization  procedure;  see,  for  example,  reference  [108].)  For  weak  range 
dependence,  we  can  decompose  the  field  A  into  its  corresponding  set  of  eigenfunctions 
and  replace  the  operator  H  with  eigenvalues  kx ;  and  similarly  we  can  substitute  -Jr)  for 

J7J .  If  there  is  no  range  dependence,  there  is  no  mode  coupling  and  the  endpoint 
transfonnations  from  A  to  X  at  the  beginning  and  X  back  to  A  at  the  end  precisely 
cancel.  If  the  coupling  is  weak  in  the  sense  that  most  of  the  coupling  is  between  nearby 
eigenfunctions  (nearby  means  the  eigenvalues  are  close),  the  endpoint  transformations 

will  involve  factors  of  Jkx/k'x  (unprime  is  from  the  A  — >  x  transformation,  prime  is 
from  the  X  ^  transformation)  and  the  transformations  nearly  cancel.  In  such  typical 
cases,  reasonably  good  answers  can  be  obtained  using  the  PE  to  directly  propagate  A  . 
Although  this  is  often  done,  it  is  good  practice  to  keep  in  mind  that  this  is  an 
approximation,  and  that  the  PE  really  propagates  an  auxiliary  field  X  ■ 
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B  Appendix:  The  place  of  backscatter  in  the  PE 
formalism 

The  parabolic  equation  converges  to  the  full  wave  solution  asymptotically.  Although  the 
expansion  is  of  ever-higher  order,  there  remains  a  small  but  finite  probability  of 
backscatter.  A  complete  description  of  propagation  through  a  range-dependent 
sound  speed  profile  must  take  this  into  account.  There  is  a  precedent  from  quantum  field 
theory109.  A  charged  particle  field  in  a  constant  uniform  electric  field  has  a  finite 
probability  of  initiating  particle-antiparticle  pair  production.  The  probability  of  pair 
production  goes  as  the  negative  exponential  of  the  reciprocal  of  the  field,  and  it  is 
therefore  non-perturbative.  Backscatter  for  the  acoustic  field  corresponds  to  pair 
production  in  a  quantum  field1 10,1 1 11 12. 

We  now  follow  the  procedure  starting  on  p.  193  of  reference  [109].  Unlike  the  electric 
field,  the  sound  speed  variation  is  automatically  decoupled  from  the  range  momentum 
component,  and  a  transformation  to  a  pure  time-dependent  gauge  is  unnecessary  (see 
equation  4-1 13  of  reference  [109]).  Therefore,  e2E2Xl  is  replaced  with  kin  ■  Now, 
locally  expand  f.l  through  second  order  (note:  for  very  strong  range  dependence,  this 
becomes  a  poor  approximation).  We  now  have  an  equation  mathematically  very  similar 
to  the  quantum  harmonic  oscillator  (see  reference  [109],  equation  4-115),  but  there  is 
now  an  extra  tenn  linear  in  the  canonical  coordinate.  This  term  comes  from  the  first 
order  of  the  Taylor  expansion.  The  effect  of  a  linear  term  added  to  the  Hamiltonian  of  a 
hannonic  oscillator  is  simply  to  shift  the  zero  point  of  the  oscillator.  This  is  not 
physically  significant,  and  it  can  be  eliminated  by  redefining  the  zero  point  of  the  range. 
Finally,  we  arrive  at  the  result  for  bosons  undergoing  pair  production  (equation  4-1 19  in 

reference  [109])  with  eE  replaced  by  k0^ji/2  : 

Probability  _  k20fl  ^  (~l)s+1  _ 

3  2  ^ 

unit  volume  1 s 

Note  that  the  crucial  quantity  is  the  second  range  derivative  of  ju  =  1=f~  ■  Also  note  that  in 
the  limit  as  the  range  dependence  disappears  (i.e.,  ji  — >  0  ),  the  quantity  on  the  right  hand 
side  of  equation  (B.l)  goes  to  zero  faster  than  any  integer  power  of  ji .  In  other  words,  it 
is  an  essential  singularity  at  ji  =  0  ,  and  backscatter  lies  at  a  higher  order  than  any  finite 
order  of  the  parabolic  equation.  Equation  (B.  1)  also  suggests  a  technique  for  calculating 
the  backscatter.  The  intensity  of  the  (incoherent)  scattered  field  would  be  omni¬ 
directional  and  proportional  to  the  probability  density  given  in  (B.l)  above. 

Solutions  evanescent  in  the  downrange  direction  play  a  prominent  role  when  there  is 
ducting  in  the  direction  transverse  to  the  original  downrange  direction  (for  example,  in 
geoacoustics,  when  a  downward  propagating  field  can  hit  a  salt  dome  that  is  elongated  in 
the  transverse  direction  and  so  acts  as  a  transverse  duct),  or  as  transients  near  a  spherical 
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wave  source1".  These  waves  do  not  produce  an  energy  flux  in  the  downrange  direction. 
Since  the  expansion  parameter  k_/k0  >  1 ,  these  solutions  are  not  compatible  with  the 

usual  Taylor  series  expansion  of  the  PE  square  root  operator  (as  pursued  in  this  paper). 
Indeed,  these  solutions  along  with  other  more  conventional  near-transverse  modes  will 
render  the  expansion  techniques  discussed  in  this  paper  ineffective.  Like  backscatter, 
evanescent  modes  only  have  weak  curvature-induced  coupling  to  downrange-propagating 
waves,  and  this  coupling  goes  to  zero  faster  than  any  finite  order  of  the  PE  expansion.  In 
fact,  pair  production,  the  mechanism  that  gives  rise  to  backscatter,  is  also  responsible  for 
coupling  propagating  waves  to  evanescent  pairs.  Once  again,  we  see  that  these  effects 
can  be  neglected  in  many  problems  of  physical  interest,  including  many  typical  ducted 
propagation  problems.  In  Sections  2.2  and  6.2.2,  it  is  noted  that  Michael  Collins  exploits 
the  existence  of  non-physical  evanescent  solutions  in  order  to  eliminate  Gibbs’ 
oscillations.  Issues  related  to  evanescent  waves  are  fully  discussed  in  the  work  of 
Fishman  et  al.  in  reference  [25]  and  in  their  subsequent  work  currently  under  preparation. 

Note  that  like  the  PE,  the  pair-production  calculation  selects  a  preferred  direction  and 
specifically  postulates  weak  range  dependence.  Perturbation  theory,  which  leads  to  the 
Born  series,  begins  with  a  fundamentally  different  assumption.  It  is  based  on  an  iterative 
procedure  that  already  at  the  1 st  order  of  the  iteration  intrinsically  generates  solutions  that 
involve  the  full  Fourier  transform  space  of  the  spatial  dependence  of  the  environmental 
parameters.  The  full  Fourier  transform  space  automatically  incorporates  backward 
propagating  causality,  and  so  in  the  context  of  the  Born  series,  backscatter  becomes  a  1st 
order  effect.  The  different  properties  of  these  expansions  arise  because  we  are  expanding 
about  fundamentally  different  limits. 

Finally,  recalling  the  qualitative  discussion  in  Section  2.3,  we  can  see  how  the  basic 
insights  in  this  appendix  generalize  to  the  interface  problem.  There,  it  was  noted  that 
when  a  surface  is  single  valued,  has  small  slope,  and  furthermore,  the  incoming  field 
consists  of  spectral  components  with  shallow  grazing  angles,  then  backscatter  is  a  subtle 
effect  related  to  the  surface  curvature  ( f  for  an  interface  given  by  z  =  f{x) ).  In  other 

words,  f  roughly  takes  the  role  of  ji  when  we  go  from  the  volume  scattering  to  the 
interface  scattering  problem.  Backscatter  goes  to  zero  faster  than  any  finite  order  PE  as 
/-»  0- 


ttt  Reflecting  the  fact  that  the  Klein-Gordon  equation  is  a  hyperbolic  partial  differential  equation  (rather  than 
an  elliptic  one),  the  equivalent  quantum  mechanical  solution  must  decay  in  at  least  one  spatial  (i.e., 
transverse)  dimension  in  addition  to  being  evanescent  in  the  downrange  (i.e.,  temporal)  direction. 
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C  Appendix:  Using  the  Foldy-Wouthuysen 

transformation  to  derive  the  basic  (constant  density) 
acoustic  parabolic  equation 


This  appendix  provides  the  key  features  of  reference  [1].  These  are  reproduced  here  for 
completeness.  The  Foldy-Wouthuysen  transfonnation  is  used  to  derive  a  parabolic  in  a 
fluid,  where  the  density  is  everywhere  the  same  (such  as  the  water  column  in  the  ocean). 
This  is  the  most  basic  application  of  the  Foldy-Wouthuysen  transformation  to  a  classical 
field,  and  it  establishes  the  paradigm  to  be  used  in  all  the  subsequent  formal  development 
in  this  paper. 

Section  C.l  recasts  the  problem  so  that  uprange  and  downrange  propagation  are  clearly 
identified.  The  2nd-order  scalar  Helmholtz  equation  is  converted  to  a  lst-order  “state- 
space”  equation  involving  2-dimensional  vectors  operated  on  by  2x2  matrices  (Section 
C.  1 . 1).  The  components  of  the  2-dimensional  vectors  represent  uprange  and  downrange 
propagation.  A  basic  attribute  of  the  matrix  operator  and  its  connection  to  energy 
conservation  is  then  discussed  (Section  C.l. 2). 

In  Section  C.2,  the  Foldy-Wouthuysen  transformation  is  used  to  decouple  uprange  and 
downrange  propagation.  The  canonical  nature  of  the  transfonnation  is  discussed  (Section 
C.2. 1)  and  then  a  form  of  the  transformation  is  derived  for  the  range-independent 
problem  (Section  C.2.2).  This  result  is  good  to  infinite  order.  Next,  a  number  of  issues 
related  to  the  range-dependent  problem  are  examined  (Section  C.2. 3).  Most  significantly, 
a  general  iterative  procedure  for  implementing  the  Foldy-Wouthuysen  transformation  is 
derived.  This  basic  procedure  generates  a  perturbative  expansion,  and  it  will  be  used 
again  later  as  the  fonnalism  is  extended  to  other  problems,  such  as  acoustics  where  the 
density  varies,  electomagnetics  and  elastic  solid  waves.  The  Foldy-Wouthuysen 
procedure  is  then  explicitly  applied  to  the  range-dependent  (constant  density)  acoustic 
problem  (Section  C.2. 4).  The  proper  symmetry  properties  are  made  manifest  (Section 
C.2. 5)  and  then  the  physical  meaning  of  the  results  are  discussed  (Section  C.2. 6). 


C.l  The  vector  formulation  of  the  Helmholtz  equation 

In  Subsection  C.1.1  we  develop  the  ansatz  of  the  procedure  developed  by  Foldy- 
Wouthuysen,  and  then  in  Subsection  C.1.2  examine  an  important  attribute  of  the  structure 
of  the  equation  generated  by  the  ansatz:  the  pseudo-Hermiticity  of  the  matrix 
Hamiltonian. 

C.1.1  Converting  a  2nd  order  differential  equation  to  1st  order 
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The  acoustic  pressure  field  is  assumed  to  propagate  according  to  the  (constant  density) 
wave  equation  through  a  time-independent  environment: 

Note  that  V2  =  V  ■  V  where  once  again  as  in  Section  2.1,  V  =  (d/dx,  Vr )  with 
VT  =  (d/dy,d/dz)  the  gradient  in  the  transverse  direction. 


Each  frequency  co  can  be  considered  separately,  and  so  the  pressure  is  given  by 

P(x,RT,t)  =  \^Pta{x,RT,t)dO) 

Pa{x,RT,t)  =  RQ[A{x,RT)e-i0}'] 

where  the  complex  field  A(x,RT )  solves  the  Helmholtz  equation. 

^~4  +  Y2TA  +  k20n2A  =  0.  (C.2) 

ox 

As  in  Section  2.1  k0  =  Co/cq  ,  c0  is  a  reference  sound  speed,  and  n  {x,RT)  is  the  index  of 
refraction. 


In  solving  second  or  higher-order  differential  equations,  it  is  common  practice  to  reduce 
the  order  of  the  derivatives  by  adding  degrees  of  freedom.  Here,  we  go  from  a  second 
order  differential  equation  for  a  scalar  A  to  a  first  order  equation  for  a  vector  <h  .  The 
components  of  the  vector  must  be  linear  combinations  of  A  and  its  first  derivative 
dA/dx .  Following  [57]  (pp.  199-207),  we  make  the  usual  choice  (e.g.,  see  Section  3.1) 

(0\ 

U; 


where 
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This  is  an  ansatz,  which  has  been  used  previously  for  the  acoustic  problem  (see,  for 
example,  [1 13], [1 14]).  Later  in  Subsection  C.1.2,  we  will  see  that  <£>  is  closely  related  to 
the  downrange  energy  flux. 

The  Helmholtz  equation  can  be  rewritten  as 

ik0  (0-z)  =  Y2r(0  +  z)  +  kin 2  (0  +  Z),  (C.4) 

where  as  always  a  denotes  da/dx  .  Manipulating  the  definitions,  we  also  have  the 
equation  of  constraint 

ik0(e  +  x)  =  kl(d-z).  (C.5) 


Adding  and  subtracting  (C.4)  and  (C.5),  expressing  the  answers  in  matrix  form,  and  we 
have 


/'<£>  =  [k0J)  +  A  rj  +  A4]  , 


where  as  in  Section  2.1, 


and  as  always 


V2 

A  =  —  -  juk0 

2  K 

M  =  ±-(\-n2) 


"1  0^ 

v0  -1, 


'  o  r 
v-l  0, 


(C.6) 


The  “odd”  operator  O  =  A%  is  off-diagonal  and  couples  the  components  of  <f>  while  the 
“even”  operator  E  =  Aij  is  diagonal  and  does  not  couple  6  and  J . 

We  now  have  (cf.  equation  (3.4)) 

3<f> 

/L—  =  n<f>  (C.7) 

ox 


where 


178 


H(A)  =  O  +  £  +  k0ri 


(C.8) 


This  formal  result  is  the  same  as  the  quantum  mechanical  analog  with  the  reference  wave 
number  k0  replacing  the  mass.  This  equation  determines  the  behavior  of  <J>  as  a  function 
of  the  range. 

Note  that  (C.7)  is  nothing  more  than  the  Helmholtz  equation  rewritten  in  vector  form.  It 
has  previously  been  derived  in  references  [114]  and  [115],  where  it  is  written  in  slightly 
different  fonn.  This  result  is  exactuuu. 

Taking  equations  (C.7)  and  (C.8)  as  a  starting  point,  the  Foldy-Wouthuysen 
transformation  of  Section  4.3  in  reference  [57]  can  now  be  used  to  solve  the  problem. 


C.1.2  Energy  conservation  and  pseudo-Hermiticity 

Before  proceeding  to  a  solution  of  the  problem,  we  digress  to  further  examine  the 
structure  developed  up  to  this  point.  This  will  serve  to  provide  a  physical  intuition  for  the 
formal  results  above  as  well  as  motivate  what  comes  later.  It  will  be  shown  that  $  is 
related  to  the  energy  flux  and  that  a  simple  mathematical  property  of  the  Hamiltonian  H 
known  as  “pseudo-Hermiticity”  guarantees  that  the  energy  is  conserved  for  all  values  of 
the  range. 

To  introduce  the  concept  of  pseudo-Hermiticity,  it  is  first  necessary  to  define  Hermiticity. 
The  Hermitian  conjugate  of  the  wave  function  <f>  is  its  transpose  and  complex  conjugate: 


uuu  Also  note  that  there  are  a  number  of  other  ways  to  recast  the  Helmholtz  equation  as  a  first  order  matrix 
equation;  i.e.  there  are  a  number  of  “branches”  for  taking  the  square  root  of  the  operator.  As  pointed  out  in 
reference  [115],  the  Dirac  operator  would  be  another  choice.  However,  unlike  equation  (C.7),  the  Dirac 
equation  introduces  a  superfluous  non-physical  degree  of  freedom  (corresponding  to  quantum  mechanical 
spin).  There  are  a  number  of  other  choices,  including  those  corresponding  to  other  spin  states.  These 
introduce  extra  degrees  of  freedom,  which  are  not  present  in  the  acoustic  propagation  problem.  Equation 
(C.7)  is  appropriate  for  spinless  particles  such  as  pions  and  K-mesons,  as  well  as  for  the  acoustic  field, 
which  of  course  also  has  no  quantum  mechanical  spin.  (Incidentally,  we  will  see  later  that  classical 
electromagnetic  and  elastic  fields  will  automatically  introduce  vector  properties  into  the  Hamiltonian,  but 
the  resulting  vector  behavior  comes  in  addition  to  the  spins  being  discussed  here  and  the  two  classes  of 
phenomena  should  not  be  confused.) 
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<£'  =  (<E>‘)r . 


To  extend  the  concept  of  Hermiticity  to  an  operator  a  it  is  necessary  to  assume  the 
sturcture 

.  (C.9) 

The  Hermitian  conjugate  cx  is  defined  by  the  equation 

J (<bV<b) d2RT  =| ((£*I>)t  <h) d2RT  . 


The  Hermitian  conjugate  of  a  matrix  operator  turns  out  to  be  the  transpose  and  complex 
conjugate  of  the  matrix,  while  for  operators  containing  powers  of  the  differential  operator 
V"  ;  it  is  necessary  to  integrate  by  parts,  picking  up  a  minus  sign  for  every  power  -  i.e., 

after  the  necessary  n  integrations  by  parts  we  pick  up  a  factor  of  (-1)  .  It  follows  from 
these  definitions  that  the  Hermitian  conjugate  of  a  product  is  the  product  of  the  Hermitian 
conjugates  in  reverse  order  (i.e.  ( a/. 3)  =  [3cx  ).  (Note  that  it  is  here  where  we  implicitly 
assume  that  the  field  is  zero  at  the  boundaries.) 

Note  that  the  structure  (C.9)  is  required  specifically  so  that  the  concept  of  Hermiticity  can 
be  extended  to  the  differential  operator  Vr .  In  quantum  mechanics,  the  integral  would 
be  over  full  3-dimensional  space,  but  for  the  PE,  it  becomes  the  2-dimensional  transverse 
space  given  by  RT  .  This  is  consistent  with  the  notion  that  the  range  x  takes  the  place  of 
the  time.  Continuing  the  analogy  to  quantum  mechanics,  RT  is  the  coordinate  vector  and 
the  operator  -zVr  is  the  corresponding  momentum  PT . 


An  operator  a  is  Hermitian  if  a'  =  a .  The  definition  of  pseudo-Hermiticity  is  similar: 
a  is  pseudo-Hennitian  if  (rja)  =  r/a  where  ^  is  defined  in  equation  (C.6).  (Note  that 
pseudo-Hermiticity  is  Hermiticity  with  the  unit  matrix  replaced  by  the  metric  i) .  As  will 
be  shown  below,  the  physical  significance  of  this  metric  lies  in  the  fact  that  the  energy 
associated  with  the  two  components  of  <£>  propagates  in  different  directions.)  Since 
A  and  1)  are  Hermitian,  %  anti-Hermitian  (i.e.  ^  )  and  i]C,  =  —%ij ,  it  follows  that  'H 

defined  in  (C.8)  is  pseudo-Hennitian. 


This  implies  that 


dx 


d2RT 


\\L&riH<S>n\d2RT 
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dx  J 

//ch 

(C.10) 
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which  in  turn  leads  to  the  result 


d_ 

dx 


=  0. 


(C.ll) 


It  will  now  be  shown  that  this  equation  is  an  expression  of  total  energy  conservation.  (By 
total  energy,  it  is  meant  that  the  forward  and  backward  propagating  components  of  the 
field  are  both  included.) 

Consider 


where  5ave  is  the  time-averaged  energy  flux  and  p  is,  as  always,  the  density  (begin  with 
reference  [98]  (equations  64.6  and  65.4),  and  then  use  a  well  known  result  for  the  time- 
averaged  product  of  the  real  parts  of  two  complex  fields).  It  follows  that 

is  proportional  to  the  total  down-range  flow  of  energy.  Since  it  is  the  same  for  all  values 
of  the  range,  there  are  no  energy  sources  or  sinks,  and  we  have  shown  that  the  pseudo- 
Hermiticity  of  7 H.  guarantees  energy  conservation.  Furthermore,  the  physical 
significance  of  the  quantity  fh  //cp ;  taken  as  a  function  of  the  position  (x,  Rr) ,  has  been 
established.  It  is  the  time-averaged  energy  density  flux  in  the  direction  of  the  range  x  . 

C.2  The  basic  acoustic  (Sp  =  o)  Foidy-Wouthuysen 
transformation 


2k, 


\(ee-zz)d2RT 

j(A-d,A-(d,A)'  A)d% 


~-Pc«\ 


Im(^*Vyf) 


(C.  12) 
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2pcA(Sme-x)d2R 


d2RT 


Here  we  examine  the  actual  Foidy-Wouthuysen  transformation.  The  canonical  nature  of 
the  transformation  is  discussed  (Section  C.2.1)  and  then  a  form  of  the  transformation  is 
derived  for  the  range-independent  problem  (Section  C.2.2).  Next  a  general  procedure  for 
implementing  the  transformation  is  derived  (Section  C.2. 3),  and  then  applied  to  the 
range-dependent  acoustic  problem  (Section  C.2.4).  The  equation  is  made  manifestly 
symmetric  under  range  reversal  (Section  C.2. 5)  and  physical  interpretation  of  the  results 
are  discussed  (Section  C.2. 6). 
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C.2.1  The  use  of  canonical  transformation  to  solve  a  problem 


At  its  core,  the  Foldy-Wouthuysen  technique  exploits  a  procedure  familiar  from  classical 
mechanics  (e.g.,  see  references  [1 16]  and  [117]).  A  canonical  transfonnation  is  used  to 
simplify  the  problem,  in  this  case  to  decouple  the  forward  and  backward  propagating 
solutions.  The  canonical  problem  is  solved  in  the  transfonned  space,  and  the  final  answer 
is  transformed  back  into  the  original  space.  In  the  present  context,  an  additional  step  is 
added.  The  Foldy-Wouthuysen  ansatz  recasts  the  scalar  field  in  vector  fonn  (and  the 
scalar  operator  of  the  wave  equation  into  a  matrix  Hamiltonian  operator).  The  approach 
is  shown  diagrammatically  in  Figure  C.l. 


The  transfonnation  $  — >  $  and  'H  — >  'H  is  canonical  if  the  form  of  the  equation  is 
preserved,  i.e.: 


i  —  =  —  =  7Td>. 

ox  ox 


The  Foldy-Wouthuysen  transfonnation  is  a  canonical  transfonnation  where  7T  is 
diagonalized,  and  the  components  of  <f> ,  9  and  X  •>  represent  decoupled  backward  and 
forward  propagating  solutions  respectively. 

Physical  systems  tend  to  be  amenable  to  numerical  simulations  largely  because  they  obey 
physical  conservation  rules  (e.g.,  energy  conservation),  which  in  turn  force  the  related 
mathematical  equations  to  enjoy  numerical  stability.  To  maintain  this  source  of 
numerical  stability,  we  also  demand  that  our  new  canonical  equation  maintain  the 
conservation  rule  on  <£>  77<f> .  This  implies  that  the  transformation  must  be  pseudo-unitary 
(i.e.  if  <f>  =  t/<f>  then  U  r/U  =  i)  ),  which  in  turn  guarantees  that  the  total  energy  flux 
remains  equal  to  -(2 pc0)  1  J (&7f&}d2RT  .  As  demonstrated  by  the  argument  in 

Subsection  C.l. 2,  the  fact  that  this  quantity  remains  conserved  even  after  downrange 
propagation  implies  that  'H  must  be  pseudo-Hermitian.  (It  also  turns  out  that  'H  is 
Hermitian.  This  can  be  seen  as  follows.  Recall  that  7T  is  diagonal,  and  so  the  equations 
for  the  components  of  <J>  are  decoupled,  and  each  must  independently  conserve  energy. 
The  corresponding  components  of  the  diagonal  matrix  77  must  then  be  Hennitian,  and  so 
is  7 i  itself.  See  also  the  results  (3.1 1),  and  the  Hamiltonians  (C.3 1)  and  (C.35)  below  in 
this  appendix  to  see  this  principle  explicitly  at  work.) 
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) 
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“Transformed”  Space: 

Hamiltonian  is  a  diagonal  matrix 


Figure  C.l  -  The  strategy  to  be  employed  is  diagrammed  above.  The  ansatz  is  used  to  convert  the 
scalar  field  into  a  2-dimensional  vector  whose  components  are,  roughly  speaking,  carriers  of  the 
uprange  and  downrange  flux.  The  Foldy-Wouthuysen  transformation  is  used  to  reformulate  the 
problem  such  that  forward  and  backward  propagating  solutions  are  decoupled.  The  problem  is 
solved  in  transformed  space.  After  propagation,  the  physical  field  is  recovered.  For  many 
applications,  propagation  effects  dominate,  and  it  is  possible  to  drop  the  contribution  from  the 
transformations  at  the  endpoints. 


C.2.2  Example:  the  range-independent  sound  speed  profile 

We  now  illustrate  the  Foldy-Wouthuysen  technique  by  examining  the  case  where  there  is 

no  range  dependence  in  the  sound  speed  profde  (i.e.  //  can  depend  on  the  transverse 

coordinate  RT  ,  but  it  is  independent  of  the  range  x).  With  the  fonnal  substitutions 

n2  , 

- >  A  ,  in  — >  kn 

2m  0 

the  PE  problem  is  identical  to  the  non-relativistic  reduction  of  the  Klein-Gordon  equation 
in  the  absence  of  an  external  field.  The  latter  is  solved  in  reference  [57]  (p.  200).  The 
corresponding  Foldy-Wouthuysen  transfonnation  is 

<f>  =  eiS<P 
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s=n&W 


where 


0  1 
V1  Oy 


0(/l) 


0(A)  ^-^tanr1-^- 
2  kn  +  A 


(C.  13) 


(This  particular  form  follows  the  notation  of  reference  [57];  however,  in  Appendix  D.  1 
below,  when  we  examine  the  relation  connecting  x  and  A  for  the  downrange  problem 
(i.e.,  the  transformation  at  the  endpoints),  the  result  will  be  recast  in  a  form  more 
conducive  to  the  calculations  to  be  pursued  there  [see  equation  (D.5)].) 

The  form  of  S  was  obtained  heuristically  by  invoking  an  analogy  to  rotations  about 
coordinate  axes  commonly  used  in  quantum  mechanics,  while  the  value  of  0  given  in 
equation  (C.13)  comes  from  the  requirement  that  off-diagonal  elements  of  H  must  be 
zero.  Note  that  S  is  pseudo-Hermitian  and  e's  is  pseudo-unitary. 

The  relation  dA/dx  implies  that  H=  e's7ie~lS  and,  from  the  definition  of  H  given  in 
(C.8),  it  follows  that  es7i  =  7 ie~lS .  Finally,  this  leads  to 


H=  He~2iS  =  ?/k0  1  +  —  . 


(C.14) 


(The  factor  of  2  in  the  exponential  cancels  the  factor  of  l/2  in  0 .)  The  exponential 
reduces  to 
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which  is  easily  reduced  (e.g.,  using  the  symbolic  manipulation  program  like  Maple). 
For  a  forward  propagating  wave,  we  have 


vly 


X, 


where 


(C.15) 


Since  the  operator  on  the  right  hand  side  of  equation  (C.15)  is  (in  this  case)  independent 
of  x ,  this  is  an  eigenvalue  equation,  and  it  follows  that  x  niust  be  expressible  as  a  sum 

of  eigenvectors  of  A  (X  =  ^c„Za  )•  Therefore,  the  transformation  at  the  endpoints 


A  =  0  +  x  =  [  1 
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(C.16) 
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simply  involves  rescaling  each  eigenvector  by  a  constant,  and  so  A  also  satisfies 
equation  (C.  15).  This  is  the  standard  PE  for  a  range-independent  sound  speed  profile 
(first  introduced  in  equation  3.24  of  [15]).  Finally,  expanding  in  2A/k() ,  we  have 

-i^  =  yjLA  +  k,(\-n)A,  (C.17) 

ox  2  k0 

and  with  A  =  TV*1'1 ,  we  recover  the  basic,  lowest  order  PE,  as  it  appears,  for  example,  in 
[15]  (equation  1.21).  It  has  been  recognized  previously  in  [1 18]  that  this  equation  can  be 
obtained  using  the  Foldy-Wouthuysen  transformation.  (Also  note  that  to  obtain  this 
range-independent  result,  it  was  not  necessary  to  drop  the  contribution  from  the 
endpoints.) 

Thus  for  a  range-independent  sound  speed  profile,  the  Foldy-Wouthuysen  transformation 
recovers  the  various  well  known  forms  of  the  parabolic  equation.  Assuming  f.l  to  be 
slowly  varying,  dju/dx  is  a  2nd  order  correction,  and  the  1st  order  result,  equation  (C.17), 
is  also  applicable  to  the  more  general  case  of  a  range-dependent  sound  speed  profile. 


C.2.3  The  general  range-dependent  problem 
C.2.3.1  The  expansion  parameter 

A  perturbative  solution  to  the  range-dependent  problem  will  now  be  obtained  by  repeated 
applications  of  the  Foldy-Wouthuysen  transformation.  The  formal  expansion  parameter 
is  the  dimensionless  operator  A  =  A /k0 .  Note  that  if  we  take  the  operator 

A  =  V^/2 kl  -  fl ,  and  operate  it  on  eigensolutions  (i.e.,  modes),  we  find  that  the  first 
operator  W2T/2k{ 02  roughly  corresponds  to  the  square  of  the  grazing  angle  and  the  second 
—ju  measures  the  deviation  of  the  local  sound  speed  from  the  reference  value.  A 
sufficient,  but  strictly  speaking  not  necessary,  condition  for  the  operator  A  to  be  small  is 
that  the  grazing  angles  and  sound  speed  deviation  both  be  small.  (The  pathological  case 
where  both  are  large,  but  cancel  will  be  discussed  near  the  beginning  of  Appendix 
K.l.1.2.) 

The  range  dependence  is  assumed  to  be  small  and  slowly  varying,  and  so  the 
dimensionless  operator  k0 1  dA/dx  =  —k0  d/j/dx  is  formally  2nd  order.  In  general,  either 

the  operator  A  or  the  operator  k0_1  d/dx  contributes  an  order. 

Some  of  what  follows  can  also  be  found  in  [57],  but  the  full  calculation  is  reproduced 
here  for  completeness.  Looking  at  the  definitions  O  =  / and  £  =  Aij ,  we  see  that, 
unlike  for  the  calculation  in  the  reference,  these  operators  are  now  of  the  same  order,  and, 
even  on  a  fonnal  level,  some  care  must  be  taken  in  transferring  the  previous  results.  In 
particular,  when  the  calculation  is  carried  through  4th  order,  it  will  be  necessary  to  add 
tenns  which  did  not  appear  previously. 
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Before  proceeding  with  the  calculation,  let  us  examine  the  source  of  these  differences. 
Consider  the  expansion  parameter 


A  = 


-ll. 


For  comparison  to  the  quantum  problem,  make  the  usual  identification  of  the  transverse 
derivative  iVr  with  the  momentum  p  ,  k0ju  with  a  scalar  potential  </> ,  k0  with  the  mass 


m  ,  and  explicitly  display  the  speed  of  light  c  which  was  set  equal  to  unity  in  the  notation 
of  [57]  (as  was  fi ).  The  result  is 


With  p  ~  mv  +  O  ( v/c) 


(note  that  v  is  the  velocity  of  the  particle),  the  expansion 


parameter  becomes 


A  — >  - 


2c 2 


■  + 


me 2 J 


This  would  mean  that  the  expansion  is  in  v/c  and  in  the  ratio  of  the  potential  energy  to 
the  rest  energy  of  the  particle.  Thus,  following  the  practice  appropriate  for  the  acoustic 
calculation,  the  kinetic  and  potential  energies  should  be  placed  on  an  equal  footing. 

While  such  an  expansion  would  be  very  reasonable  from  a  formal  relativistic  point  of 
view,  it  is  not  followed  in  many  practical  quantum  mechanical  applications.  Of  the 
calculations  perfonned  in  reference  [57],  the  one  most  closely  analogous  to  the  acoustic 
problem  (because  it  is  spinless;  i.e.,  has  no  non-physical  degrees  of  freedom)  is  given  in 
chapter  9.  However  in  that  particular  calculation  (j)  is  not  small  compared  to  the  rest 
energy,  and  it  is  therefore  not  treated  as  an  expansion  parameter.  On  the  other  hand  for 
the  calculation  involving  the  hydrogen  atom  (chapter  4),  (j)  is  considered  to  be  of  smaller 
order  than  the  kinetic  energy  term. 


C.2.3.2  The  iterative  procedure 


To  generate  the  expansion,  a  succession  of  transfonnations  is  used,  each  having  the  form 
<f>  =  e's  <f> .  With  each  iteration,  S  is  of  increasing  order  in  A  ,  and  the  transformation  is 
designed  to  diagonalize  77  to  that  order.  Henceforth,  Roman  numeral  superscripts  will 
denote  the  order  of  S  and  of  the  diagonalized  part  of  77  .  The  corresponding  <f> ,  O  and 


£  will  also  be  labeled  in  this  way.  For  example,  <f)/ 


e  <f>  obeys  the  equation 


dx 


H1®1 


(C.18) 
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where  H1  =  k0ri  +  8‘  +01  to  0  (A) . 


m  m  _ *rr  /  ~  j  t  t  t  <  ' 

Substituting  e  <f>  for  <f>  in  the  original  equation  of  motion  (C.7),  we  have  the  result 


0  =  e 


iS1 


f 


n-i— 

V  ox  J 


e  iS  <f>7 . 


(C.19) 


From  a  theorem  given  in  reference  [57]  (p.  49),  and  using  7i  =  0  (A0 )  and  S1  =  O  (A) . 
we  have 


iS  nj  —iS 


er  He 


H - 

4! 


,  (C.20) 


and,  also  using  ^S'/,(3/dx)]  =  -S1  =  0(A2) , 

—e-iS‘  =  — 
dx  dx  2 


eiS'  ^-e-iS‘  =^--7A/+^-[1S/,1S/]  +  ^[1S/,[1S/,1S/]]  +  0(A5).  (C.21) 


These  results  are  substituted  into  (C.19)  and  the  expression  for  H1  is  obtained  by 
comparison  to  (C.18).  (Recall  that  the  commutator  [,]  is  defined  in  footnote  f.) 


By  analogy  to  the  range-independent  calculation,  choose 

irjO 


S1  = 


2  L 


(C.22) 


and  evaluate  the  expression  for  H1  using  rjO  =  - Or) ,  i]£  =  £ij ,  rj2  =  1 ,  and  equation 
(C.8). 

Since  8 /k0,O / k0  =0  (A) ,  the  anticommutation  of  O  and  rj  implies  that 
i [S'7 , H ]  =  —O  +  O  (A2 ) .  All  other  new  terms  such  as  those  proportional  to 

[V,[S',H]]  or  S1  are  second  order  or  higher.  Thus,  by  construction,  the  Foldy- 

Wouthuysen  transfonnation  cancels  the  odd  1st  order  tenns,  and  replaces  them  with  new 
tenns  of  higher  order. 

Note  that  this  result  depends  only  on  the  definition  of  S  and  the  anticommutation 
relation  {r],0}  -  0 .  The  procedure  can  be  repeated  order-by-order,  with  the  order  of  the 

remaining  odd  terms  increasing  with  each  iteration.  As  can  be  seen  from  the  following 
argument,  the  anticommutation  relation  will  always  be  met. 


Beginning  with  the  Hamiltonian  H  and  applying  repeated  transformations  of  the  sort  just 
discussed,  the  transformed  Hamiltonian  will  consist  of  terms  whose  matrix  part  is 
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constructed  from  products  of  77 's  and  £'s.  Since  £2”  =  (-1)”  1 ,  rfn  =  1  and  {£,77}  =  0 , 
the  matrix  part  of  the  odd  operators  will  be  either  ij^  or  %  while  that  of  the  even 
operators  77  or  1 .  It  follows  that  the  odd  operators  all  anticommute  with  77 . 

The  anticommutation  relation,  together  with  the  iterative  nature  of  the  procedure  also 
insures  that  energy  will  be  conserved  no  matter  how  many  times  the  procedure  is 
repeated.  Before  iteration,  'H  and  therefore  O  arc  pscudo-Hermitian  (0"H-  'HO  ), 
which,  using  the  anticommutation  relation,  also  implies  that  S  =  -  irjO / 2k0  is  pseudo- 

Hermitian.  Thus,  elS  is  pseudo-unitary  and  the  transformation  preserves  energy 
conservation.  The  new  Hamiltonian  and  consequently  the  new  O  are  therefore  once 
again  pseudo-Hermitian,  and  the  process  can  start  all  over  again. 

In  this  way,  we  have  provided  an  order-by-order  prescription  for  diagonalizing  the 
Hamiltonian.  This  implies  that  the  “true”  coupling  between  forward  and  backward 
modes  (i.e.  backscatter)  is  of  higher  order  than  any  finite  order  of  perturbation  theory, 
and  that  the  perturbative  expansion  constructed  in  this  way  must  be  asymptotic.  We  will 
return  to  this  issue  later. 


C.2.4  Implementing  the  procedure 


On  page  50  of  [57],  the  terms  needed  to  implement  the  first  Foldy-Wouthuysen 
transformation  are  evaluated.  The  results  are  included  here  for  completeness.  Note  that 

any  odd  operator  to  an  even  power  is  even.  Tenns  of  0(A'j  and  higher  are  dropped. 


Recall  that  S1  is  given  by  equation  (C.22).  The  first  iteration  of  the  Foldy-Wouthuysen 
procedure  yields  the  Hamiltonian  H1 .  From  equations  (C.  19)  to  (C.21),  H1  is  given  by 
the  original  matrix  Hamiltonian  TL  followed  by  a  sum  over  the  following  tenns: 


i[sJ,n] 

y,[s7,w]] 


=  -0+ 


r/O2 

2  K 


JL 

2  K 


8  k: 


[0,£]  +  ^-r/02 

K 


O3  TjO4 

~6ki  ~6k[ 
r/O 4 
24  k3 


iijO 

2  K 


8*0 


0,6 
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To  the  order  that  we  are  concerned  with,  it  would  at  first  appear  that  two  terms  are 
missing: 

in  [>',«]]] 

and 

Vis'll  =  — l^\o,\o,6\ 

6L  ’L  JJ  24£0  L  ’L  JJ 

However,  these  tenns  are  4th  order  and  odd.  The  next  Foldy-Wouthuysen  transfonnation 
will  remove  these  odd  terms,  replacing  them  with  terms  of  5th  order  and  higher,  which  is 
of  higher  order  than  concerns  us  here.  Therefore,  to  obtain  the  diagonalized  Hamiltonian 

to  0(A4)  ,  we  can  use  the  results  from  [57]  without  modification.  However,  we  have  to 

be  careful  to  make  sure  we  include  all  relevant  terms  during  the  next  stages  of  the 
diagonalization  procedure.  As  mentioned  already,  the  order  counting  in  the  reference  is 
different  from  that  which  must  be  used  for  the  parabolic  equation. 


Combining  the  terms  calculated  above,  we  have  the  result 

n1  =  k0Ti+sI+o,+ o(a5) 


o2  _  o 4 
2  k0  8  kl 


+f“^[°N>4]-A[o,o]  (C23) 


o'  =  £[o.e]-£+3r= o(a’). 


2  K  2*0 


So  far,  these  results  are  the  same  as  for  the  relativistic  reduction  problem  in  quantum 
mechanics. 

Now,  the  procedure  is  iterated  to  eliminate  odd  terms,  which  are  2nd  order  in  A  .  The 
second  Foldy-Wouthuysen  transfonnation  is: 

S"  =  -^  =  0(A2) 

2k Q 

H“  =  W'+i[s",W']-i[s",[s",W']]-S"+0(A5)  (C.24) 

=  k^  +  £‘  +i[s“  ,£‘]+'-[s“  ,0']+’^-  +  0(A5), 

where  the  result  =  iO1  was  used  to  simplify  H 11 .  Simplifying  the 

commutators,  we  now  have 
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(C.25) 


Hn  =  k0Ji  +  £n+On+  0(A5) 

(o1)2 

£"  =  £'  +TJ- — — 

2k0 

o"  =  JL[oi,£i]+1?°-=o(A3) 

2  k0  2k0 

Note  that  the  second  term  in  £u  is  new;  this  tenn  did  not  appear  in  the  calculation  of 
reference  [57]  (see  paragraph  below  equation  (C.29)).  Now,  we  apply  the  Foldy- 
Wouthuysen  transformation  yet  again 

sui  =_^  =  0(A3)  (C.26) 

2k0 


and  obtain 

nm  =  k07]  +  £n  +  i  [s'" ,  £u  ]  -  SIU  +  O  ( A5 ) . 


(C.27) 


The  last  two  terms  of  7 t  are  4th  order  odd  terms.  Once  again,  note  that  the  only  effect 
of  the  fourth  Foldy-Wouthuysen  transformation  will  be  to  push  such  odd  4th  order  terms 
up  to  higher  order,  so  we  finally  have 


KIV  =k0n  +  £u , 


(C.28) 


where 


V 


o-  O 


4  A 


gii  _  \2k0  8 k0  j 


+  £  — 


8k; 


-[0,[0,£]]~ 


8^L 


0,0 


+ 


j]_ 

8  kl 


(-[0,£f -i{0,[0,£]}  +  02)+  5th  order 


(C.29) 


(This  recovers  equation  (3.10).)  Recall  that  {,}  denotes  the  anticommutator.  The  last 
tenn  is  the  expansion  of  tj  [O1 )  j2k{) . 

Note  that  even  on  this  formal  level,  we  now  have  extra  4th  order  terms,  which  do  not 
appear  in  the  results  of  [57].  This  is  because  powers  of  the  expansion  need  to  be 
tabulated  differently  in  that  problem.  On  the  other  hand,  note  that  this  is  the  only 
difference  so  far  between  the  quantum  mechanical  and  acoustic  problems.  Indeed,  the 
only  assumption  that  went  into  obtaining  equation  (C.29)  was  that  the  Hamiltonian  77 
have  the  basic  form  (C.7),  and  that  its  constituents  O  and  £  obey  the  commutation 
relations  described  at  the  end  of  Subsection  C.2.3.2  above.  Equation  (C.29)  is  therefore 
quite  general,  and  will  apply  to  the  acoustic  problem  where  the  density  varies,  and  even 
to  vector  fields  (e.g.,  electromagnetic  and  elastodynamic  fields). 

Now  substitute  £  =  Xtj  and  0  =  into  (C.29).  This  gives 
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(C.30) 


[0,£]2=4A4 
-i{0,[0,£]}  =  2t][A2,A 
62  =  -A2 


Combining  results,  we  have 


f 


n'v  =  T]k0 


A  A2  A3  5  A4  A 


2  \ 


1  + - v+  , 

L  k:  2k: 


+  ■ 


V  ,vo 
i 


8  k4 


Kj 


8k, 


o 


[a,A]--^[a2,A]+ o(as) 


=  7jk0 


,  2/1 
1  +  — 


A 


4k( 

2  A 


(C.31) 


8klj 


+  ■ 


r  A, A] — ^r/lUl  +  0(A5) 

L  J  4k  1  J  y  1 


where  recall  A  =  V2  / 2k 0  -  jilkn  and  there  is  an  implicit  unit  matrix  1  in  the  last  two 
tenns.  In  the  last  line,  we  have  formally  replaced  the  terms  I  +  yk  -  :  ^  -  sxy%k 4 

with  ■sJ\  +  2A/k0  ,  as  always  following  the  common  practice  of  defining  the  function  of  an 
operator  by  the  Taylor-Series  expansion  (the  classic  example  of  this  being  identification 
of  the  formal  expression  exp  (a  with  translation  by  a  ).  As  was  assumed  in  deriving 
equation  (C.31)  and  everywhere  else  this  paper,  this  implicitly  assumes  that  the  operator 
2A/k0  is  small  in  some  sense. 


Note  that  in  the  limit  A—>  0  (i.e.,  no  range  dependence)  we  recover  the  expansion  of 
H  =  rjk0-sjl  +  2A/k0 ,  which  is  indeed  the  result  for  a  range-independent  sound  speed 
profile.  Also  note  that  if  there  is  no  transverse  dependence  in  fi ,  then  the  commutator 
terms  disappear  as  well.  Thus  the  commutator  terms  are  associated  with  the  simultaneous 
presence  of  both  range  and  transverse  dependence. 


For  the  diagonal  Hamiltonian  ki  ,  the  pure  forward  and  backward  propagating  solutions, 

respectively , 


f  0^ 

rr 

cc 

llj 

and  ‘h 

,0/ 

remain  of  the  same  vector  form  as  they  propagate.  For  these  solutions,  ij  can  be  replaced 
by  +1 ,  so  that  the  energy  flux  is  proportional  to  ch  ch .  Following  the  same  reasoning  as 
before,  it  follows  that  the  range  propagation  operator  must  be  unitary  and  ? i  must  be 
Hermitian.  Being  diagonal  and  Hermitian,  it  remains  pseudo-Hennitian  as  well.  Using 
the  expression  for  Hn  given  in  equation  (C.3 1),  Hermiticity  can  also  be  verified  directly 
by  inspection.  Note  that  the  forward  and  backward  propagating  solutions  independently 
conserve  energy,  the  perturbative  formalism  making  no  allowance  for  energy  loss  due  to 
backscatter. 
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The  explicit  appearance  of  the  complex  number  i  as  a  coefficient  in  equation  (C.3 1) 
creates  the  illusion  that  the  propagation  may  be  diffusive.  However,  note  the  term  is 
actually  Hennitian,  and  so  overall  energy  is  conserved,  and  there  is  be  no  diffusion  here. 
It  will  shortly  be  shown  that  it  is  possible  to  further  transform  the  result  (C.3 1),  among 
other  things,  removing  these  complex  coefficients. 


C.2.5  Manifestly  range-reciprocal  form 


It  is  possible  to  apply  a  variation  of  the  Foldy-Wouthuysen  transfonnation  to  rearrange 
tenns  without  changing  the  order  of  the  diagonalized  part  of  the  Hamiltonian.  For  the 
Hamiltonian  (C.31),  choose 


sir.i  _  V  dA  _  JjA 
8 k2  dx  8 k2 


0(A=). 


(C.32) 


(There  is  here  a  slight  labeling  change  here.  The  Roman  numeral  on  S  still  corresponds 
to  the  order  to  which  the  Hamiltonian  is  diagonal,  but  not  to  that  of  S  itself  any  more. 
The  “1”  after  the  Roman  numeral  identifies  Sn  1  as  the  generator  of  the  first  order¬ 
preserving  variant  of  the  Foldy-Wouthuysen  transformation.)  Note  that  Sn  A  is  diagonal 

and  Hennitian,  so  the  transfonnation  e‘s  is  unitary,  diagonal  and  pseudo-unitary,  so 
energy  conservation  considerations  discussed  in  the  previous  section  will  not  be  affected 
by  the  transfonnation. 


Now,  we  have 
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8 k0  8kL0  j 


3/ 


16  k: 


■[A2,i]  +  0(A5)  (C.33) 


A 


kg  8  kg  8  kg  J 


3/ 


16  K 


■[A2,i]  +  0(A5). 


Perform  a  second  such  transformation; 


iiv.  2 


16  kg  dx  16  kg 


where  recall  j/l,  AJ-  AA  +  AA  .  Now, 


(C.34) 
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n,vl 


=  H1VA  +  i  [5/f'2 , 7  (k0  +  X)] -  SIV2  +  O  ( A5 ) 


=  vK 

=  vK 


A  A2  A3 

1  + - ^  +  — r 

kn  kl  2  kl 
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(C.35) 


Equation  (C.35)  is  the  most  important  result  of  this  appendix  and  it  corresponds  to 
equation  (3. 1 1)  in  the  main  body  of  the  text.  Once  again,  note  that  the  square  root 
operator  above  is  essentially  a  placeholder  for  its  Taylor  Series  expansion.  The 
distinction  is  important,  when  the  expansion  parameter  2A/k0  >  1 ,  but  this  is  outside  the 
purview  of  this  work  (see  the  discussion  in  the  third-to-last  paragraph  of  Appendix  B). 


Thus,  7i  =  /;//  where 


and  so 


H  =  k. 


2A  A 


1  K 

i4> 

=  J]II 

2  6' 

U/ 

kXj 

(C.36) 


For  one-way  (downrange)  propagation,  0  =  0  and  dropping  the  tilde  on  X ,  wc  have 

~iX  =  HX- 

Note  that  the  new  terms  manifestly  exhibit  range  reciprocity.  This  means  that  the 
substitution  x  — >  —x  will  not  change  the  fonn  of  the  scalar  Hamiltonian  at  all,  and  in  the 
overall  matrix  equation  it  only  flips  the  uprange  and  downrange  labels.  The  original 
wave  equation  obeys  this  symmetry,  and  it  is  good  to  get  a  form  of  the  PE  that  also 
manifestly  maintains  this  symmetry.  For  this  reason,  the  fonn  given  in  equation  (C.36) 
(and  its  higher-order  versions)  above  will  always  be  used  in  preference  to  the  scalar 
version  of  the  form  given  in  equation  (C.3 1). 

Recall  that  modem  wide  angle  PE's  based  on  the  so-called  Pade  approximation  do  not 
necessarily  reproduce  the  series  expansion  in  the  appropriate  limit  (see  footnote  m),  and 
so  adaptation  of  results  such  as  equation  (C.36)  to  these  approaches  will  be  non-trivial. 

It  can  also  be  shown  that,  to  the  order  obtained  here,  the  result  is  independent  of  the 
choice  of  k0 . 
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Also  note  that  to  3rd  order,  we  have  the  straightforward  substitution  //  — >  //  —  /// (8k2 ) 

(recall  that  A  =  V2  / 2k 0  -  k0ju  and  jil  =  d2/j/dx2  refers  to  the  second  derivative  with 
respect  to  the  range).  To  this  order,  this  is  equivalent  to  rescaling  the  sound  speed 
c  — >  c  —  cj (8 k2 ) . 

The  results  may  seem  somewhat  surprising  at  first.  For  example,  the  transformation 
Sn  1 ,  roughly  speaking,  substitutes  a  dependence  on  3  (v2//)  jc)x  for  one  on 

jil  =  32///3x2  .  These  terms  can  be  zero  at  different  times,  but  this  apparent  discrepancy 
disappears  after  closer  scrutiny.  If  jil  =  0  ,  then  [//,/'/]  ^  d /a/ dx,  and  the  commutator 

tenns  simply  contribute  to  the  transformation  at  the  endpoints.  The  contribution  is  only  a 
phase  change,  since  the  tenn  is  Hermitian.  In  fact,  since  we  always  assume  the  range 
dependence  is  locally  zero  at  the  endpoints,  this  contribution  is  zero,  and  the  commutator 
tenns  do  not  contribute  at  all  to  the  final  answer.  Now,  lets  consider  the  opposite 

scenario:  if  [A.AJ  =  0 ,  then  jil  =  d/d/dx  and  jil  is  a  perfect  derivative,  and  again  we  are 

left  with  a  trivial  endpoint  (phase)  change,  which  goes  away  entirely  if  we  neglect  the 
range  dependence  at  the  endpoints. 

In  this  way,  we  can  begin  to  see  how  these  two  very  different  representations  of  the  same 
problem  still  lead  to  the  same  solution.  The  notion  that  very  different  looking 
Hamiltonians  yield  the  same  range  propagation  results  is  not  unique  to  this  problem. 
Canonical  transfonnations  often  have  this  attribute.  This  is  most  easily  seen  when  the 
problem  is  formulated  in  terms  of  the  Lagrangian.  The  equations  of  motion  are  not 
changed  by  the  addition  of  a  total  time  derivative  of  a  function  of  the  coordinates  and  the 
time  ( df  (q,  t)/dt ).  However,  after  returning  to  the  Hamiltonian  picture,  the  connection 
between  the  corresponding  Hamiltonians  may  be  quite  obscure. 


C.2.6  A  physically  intuitive  picture  of  the  results 


We  are  now  in  a  position  to  develop  physical  interpretations  of  the  Foldy-Wouthuysen 
procedure.  The  transformations  used  to  diagonalize  the  Hamiltonian  seek  out  the  effects 
which  accumulate  during  propagation,  and  separate  them  from  effects  which  cancel 
everywhere  except  at  the  endpoints.  The  former  come  from  “virtual”  oscillations 
between  forward  and  backward  propagating  modes.  This  concept  can  be  developed  as 
follows.  Recall  the  prescription  //  — >  //  -  jl /(8k02 ) .  Consider  a  coordinate  shift 

x  —>  x  +  Sx  and  expand  //  (x  +  Sx) .  We  have 


fj(x  +  Sx)  =  //(x)  +  jii{x)  5x  +  ^-//(x)  Sx1 


and  treating  Sx  as  a  random  fluctuation  to  be  averaged,  we  have 
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[ju  (x  +  Sx ))  =  //  (x)  +  jU  (x)  [Sx)  +  —ju  (x)  (Sxz 


Comparing  the  prescription  //  — >  //  -  juj(%kl ) ,  we  conclude  that  the  fluctuation  has 
average  [Sx)  =  0  (this  is  perfectly  reasonable;  most  random  fluctuations  have  this 
property)  and  root  mean  square  displacement 


Note  that  this  corresponds  to  fluctuations  on  the  order  Sx  ~ij (2 k0 ) .  The  quantity  is 

imaginary  because  it  is  associated  with  an  evanescent  wave  phenomenon.  As  alluded  to 
in  Section  2.1,  the  lst-order  boundary  term  is  sometimes  moved  from  the  endpoint 
transformation  into  the  Hamiltonian,  where  it  becomes  //i/2  (e.g.,  see  Schurman  et  al. 19, 
and  also  Appendix  B  of  reference  [1]).  It  was  pointed  out  in  Section  2.1  that  this  is  not 
usually  a  good  practice,  but  here  it  is  instructive.  Including  both  the  new  Foldy- 
Wouhuysen  term  and  this  tenn  in  the  Hamlitonian,  we  are  left  with  the  simple 
prescription  //(x)  — >  fl(x  +  i/2k0) ;  i.e.,  a  small  shift  of  x  into  the  complex  plane. 

Note  that  when  averaged  over  any  scale  resolvable  by  the  wavelengths  of  the  field 
involved,  no  energy  flows  from  the  forward  into  the  backward  propagating  modes  (or 
vice  versa).  This  is  what  is  meant  by  a  virtual  oscillation  between  the  modes.  Higher- 
order  terms  correspond  to  virtual  fluctuations  into  multiple  backward  propagating  modes. 
(This  can  be  deduced  by  once  again  appealing  to  the  analogy  to  quantum  field  theory. 

The  Feynman  diagram  picture  identifies  the  order  of  a  term  in  the  perturbative  expansion 
with  the  number  of  virtual  particles  [or  modes]  created.)  The  diagonalization  procedure 
also  serves  to  push  “true”  (as  opposed  to  virtual)  backscatter  to  its  “natural”  place  at 
infinite  order  in  perturbation  theory. 

The  oscillations  between  forward  and  backward  propagating  modes  have  the  attributes  of 
a  hannonic  oscillator.  The  transformations  used  to  simplify  the  diagonal  representations 
of  the  Hamiltonian  can  be  understood  as  translations  of  the  zero  point  of  the  harmonic 
oscillator.  The  crucial  quantity  characterizing  both  the  virtual  fluctuations  and  the  “true” 
backscatter  is  ju ,  which  corresponds  to  the  spring  constant  of  the  oscillator.  For  “true” 
backscatter,  the  oscillator  is  not  bound,  and  the  spring  constant  is  negative.  Once  again, 
we  are  reminded  that  “true”  backscatter  is  not  a  process  that  lends  itself  to  this  kind  of  a 
perturbative  approach.  For  virtual  oscillations,  on  the  other  hand,  the  analogous 
oscillator  is  bound,  the  spring  constant  positive,  small  fluctuations  remain  small,  and 
perturbation  theory  is  appropriate. 
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D  Appendix:  The  transformation  between  the  pressure 
and  the  PE  field 

The  goal  here  is  to  establish  the  relationship  between  the  auxiliary  field  X  that  propagates 
according  to  the  PE  (equation  (3.13);  in  this  appendix  we  put  tildes  on  x  and  H  to 
clearly  distinguish  between  the  ansatz  (no  tildes)  and  the  transformed  problem  where  the 
matrix  Hamiltonian  is  diagonal  (tildes)),  and  the  actual  pressure  field  A  that  propagates 
according  to  the  Helmholtz  equation  (equation  (2.2)). 


D.1  The  transformation  when  the  density  is  everywhere  the 
same 

From  the  Foldy-Wouthuysen  ansatz  (equation  (3.2);  the  definition  of  the  2-dimensional 
vector  <£> )  we  have  immediately  the  equation 

(1  l)<F  =  zi.  (D.l) 

We  also  need  to  be  able  to  go  from  the  2-dimensional  field  vector  that  propagates 
according  to  the  diagonalized  Hamiltonian  <E>  back  to  the  ansatz  vector  <T> .  Recall  from 
reference  [1]  (or  equivalently  Appendix  C.2.3.2)  that  the  Foldy-Wouthuysen  procedure  is 
iterative.  If  obeys  a  matrix  equation  of  motion  that  is  diagonalized  to  order  n  - 1 , 
then  one  more  Foldy-Wouthuysen  transfonnation 

4>.  :  (D.2) 

will  produce  a  field  <f>(,  that  propagates  according  to  a  matrix  equation  of  motion  that  is 
diagonalized  to  order  n  .  So,  we  can  write  in  general 

<f>  =  e~iS' e~iSn  e~iS,n  •  ••  <f> 


and 

A  =  (  1  1 )  <T>  =  ( I  \)e~iSle~iSne~iSm  •••<!>.  (D.3) 

The  Foldy-Wouthuysen  (FW)  procedure  precisely  determines  Sf ,  SH ,  Sm  •  •  • ,  and  we  can 

substitute  for  these  operators  to  calculate  product  e~'Sl  e~,s"  e~'s'"  •  •  •  order  by  order.  (Note 
that  in  this  context,  Roman  numerals  enumerate  successive  FW  transformations,  and  do 
not  refer  to  sides  of  an  interface.)  In  Appendix  D.2,  we  will  employ  precisely  this 
strategy  to  obtain  the  x  A  transformation  good  to  0(  A2, Ay)  for  the  case  when 
y  =  (p0  -  p)/2p0  is  non-zero  (i.e.,  the  density  is  not  uniformly  constant).  However,  here 

where  the  density  is  held  uniformly  constant,  let  us  do  something  a  little  different.  If  we 
assume  that  there  is  no  local  range  dependence  (e.g.,  those  extra  Lamb-shift  terms  are  all 
zero),  we  can  take  advantage  of  the  fact  that  we  already  know  how  to  calculate 
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e~tS‘  e~lS"  e~,Sl"  •••to  °°  -order.  To  do  so,  we  follow  Section  II.B  in  reference  [1],  Section 
[9.7]  in  reference  [57],  and  the  treatment  above  in  Appendix  C.2.2.  Rather  than  iterative 
infinite  series,  only  a  single  Foldy-Wouthuysen  transfonnation  is  needed  to  fully 
diagonalize  the  matrix  Hamiltonian  77  .  In  particular,  we  have 


,  5$ 


2/1 


i^=Kv.  i+— ' 

ox  \  kn 


(D.4) 


where  (see,  for  example,  equation  25  in  reference  [1]  or  equation  (C.13)  above) 

<f>  =  eiS$> 


S  = 


(p  =  tanh  1 


"0 

n 

U 

0/ 

U  J 

<P 


(D.5) 
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71  +  kn 


The  equations  have  been  slightly  recast  into  a  fonn  more  conducive  to  the  calculations  to 
be  pursued  below. 


Thus  we  have 


0  1 

=  eiS<$>  =  eyi  oj2<I> 


and  we  need  to  evaluate  the  exponential.  Using  the  symbolic  manipulation  package 
Maple™,  we  have 


o  1 
1  0l2 


f  V/  -9/  9/  -V/  \ 

eA  +  e  /2  e/2  -  e  A 


2 

£A-e 


2 

+e~ 


or 


l  2 

2  J 

i 

'  l  +  e* 

-1  +  e^ 

2V7 

v-l  +  ^ 

1  +  ef  y 

Thus,  we  have 


$ 


'S' 


■  =  eiS 


'e\ 

27/ 


(  l  +  eip  -1  +  e^ 
y-l  +  e*  X  +  e*  j 


.27/ 


and  multiplying  by  (1  l),  we  get 


Q  +  X  =  -L=(2e(p  2e*)  \  =  ^(6>  +  Z). 

v27y 
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D.2  The  transformation  when  the  density  is  only  constant  in  the 
half-space 

In  this  case,  we  are  only  interested  in  the  answer  to  second  order  (i.e.,  0(  A2,  Ay)).  Thus 
we  begin  with  equation  (D.3)  adjusted  to  this  requirement: 
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A  =  (  1  l)e“"V's"<J> 


(P-9) 


and  calculate  S,  and  Sn  to  the  required  order. 


The  Foldy-Wouthuysen  procedure  (see  reference  [1],  equation  (35);  or  equivalently 
equation  (C.22)  above)  gives  us 

_  _-irjO  _-i{r}{)A_-a(0  O 


2  K 


2kn 


2  L 


V1  0y 


(D.10) 


and  so 


(D.l  1) 


Substituting  equation  (39c)  from  reference  [1]  (or  equivalently  the  third  part  of  equation 
(C.23)  above)  into  equation  (40a)  from  reference  [1]  (or  equivalently  the  first  part  of 
equation  (C.24)  above),  and  dropping  the  tenn  that  goes  away  if  there  is  no  local  range 
dependence  as  well  as  the  term  proportional  to  O3 ,  which  is  higher  order  than  concerns 
us  here,  we  have 


_  -irjO 

~  IK 


-jy  v 
2 K  2K 
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\_A4,{A-2kj)rf\ 
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(D.12) 


and 


e~iS“  =  e^2k° 


f[2i2-2i0{ni}](  l 


0  1 


Expanding  the  exponentials  to  0(  A2,  Ay),  we  have 

e-iS,e-iSn  _1  + 


a  +  a2  { r Y} 


2  k0  2  k0  2  k0 


Yo 
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+  -• 

/I2 
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4  K 
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(D.13) 


0(A\yA2,fA),(D.l4 ) 


and  (using  equation  (C.8)) 

A  =  (  1  l)e“‘ViS"4>: 


,  2  ( il-  {y.l} 

2  k0  8^  2  k0 


(1  1) 

'o' 

X 

(D.l  5) 


199 


Only  now  do  we  need  to  invoke  the  fact  that  the  density  is  locally  constant  in  the  half¬ 
spaces  away  from  the  interface  so  that  {y,  Aj  =  2yA  .  Recalling  from  equation  (5.13)  that 

a  =  pj  p  =  \  +  2y  +  0(y2)  we  have  to  0(A2,A  y) 


A  =  \  1 


5(aA)2 


+  •••  (0  +  z)  =  \  1  + 


(0  +  z)- 


(D-16) 


and  setting  uprange  propagation  6  to  zero,  we  have  indeed  verified  to  0(  A\Ay)  that 


A  =  Z- 


(D.17) 


E  Appendix:  A  close  look  at  volume  fluctuations  and 
the  PE 

This  appendix  closely  examines  volume  fluctuations  where  the  range  dependence  is 
generated  by  distorting  the  range-independent  sound  speed  function  [//]  (z)  by  an 

arbitrary  function  of  the  range  and  depth  /  ( x,z ) vvv.  This  way  of  imposing  range 
dependence  is  crudely  illustrated  in  Figure  E.l. 


vvv  Subject  to  the  restriction  that  the  range-derivatives  of  f  are  small.  This  restriction  comes  explicitly 
from  the  range-derivatives  in  the  new  terms  generated  from  the  Foldy-Wouthysen  transformation  (such  as 
—  A/$/(q  ),  and  implicitly  from  the  well-known  PE  requirement  that  fields  are  modestly  inclined  from  the 
horizontal. 
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How  to  construct  range-dependence  for  the  stochastic  PE 


% 


Lines  of  constant 

ME)  (i  e., 

contour  lines  or 
equipotentials) 
are  range 
independent 


Distort  the  contour  lines  to 
create  range  dependence 


Contour  lines  for 
p (x,z)  are  now  a 
series  of  rough 
surfaces  of  the 
form  f(x,z0). 


Figure  E.l  -  Range-dependent  volume  fluctuations  of  the  sound  speed  function  /l  =  [\  —  c\j  C^jT. 
are  generated  by  distorting  contour  lines  in  the  range-independent  problem. 


The  contour  lines  are  now  quasi-planar  rough  surfaces,  and  so  the  stochastic  problem  is 

obtained  by  range  averaging  functions  of  /  (i.e.,  ^/2^(z)  =  \  j  f2  (x,z)dx ,  etc.).  The 

parabolic  equation  implicitly  assumes  that  the  range-dependent  problem  is  derived  from 
some  range-independent  problem  by  a  distortion  of  this  general  fonn.  This  insight 
becomes  crucial  when  we  consider  volume  fluctuations  adjacent  to  cusps  and 
discontinuities.  We  will  see  below  that  this  way  of  generating  stochastic  range 
dependence  allows  the  averaging  process  used  within  the  volume  to  dip  down  into  the 
nooks  and  crannies  adjacent  to  a  rough  line  along  which  the  sound  speed  has  a  cusp  or  a 
discontinuity.  Thus,  the  stochastic  results  derived  in  this  appendix  will  still  apply 
everywhere  except  right  on  top  of  the  singularity. 

The  basic  geometry  of  a  distorted  contour  line  is  shown  in  Figure  E.2.  The  illustrated 
contour  line  was  located  at  a  depth  z0  in  the  original  undistorted  problem.  After 
distortion,  we  have 
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V//  = 

t  = 


The  slope  of  t  (i.e.,  the  slope  of  the  tangent  to  the  contour  line)  is  given  by 


djU  dju ' 

v  3x  3z  y 
/  d/J  djU ' 

v  dz  ’  dx  , 


or 


3// =  3// 

dx  dz 


(E.l) 


where  a(x,z)  =  —f  (x,z0)  is  the  negative  of  the  slope  of  the  tangent  to  the  contour  line 
passing  through  that  point.  It  is  a  small  random  function  of  x ,  and  it  depends  on  z  too. 


The  basic  geometry  of  a  contour  line 


Recall  that  in  this  appendix,  we  are  concerned  with  the  new  term  -  Xj  Ski  =  ■  Let 

us  drop  the  factor  J/U2  and  concentrate  on  ju  . 


The  starting  point  is  formula  (E.  1).  Taking  the  partial  derivative  with  respect  to  the  range 
x  gives  us 


3 2 /i  _  3  f  dju^ 
dx2  dx  ^  3z  j 


.  da 

a—  +  a 
dz 


3 2  n 
3x3  z 


Now  use 


d2/j  _  3 2jU  _  3  f 
3x3 z  dzdx  dz  v  3 zy 


3  a  dju  3  2jU 

dz  dz  dz2 
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to  get 


d2jU  _  .  djU  2  d2jU  da  d/U 


dx2  adz+a  dz2 


+  a- 


dz  dz 


(E.2) 


Result  (E.2)  is  quite  general.  In  particular,  along  the  line  z  =  z0 ,  we  have 

^,Zo)  =  a(x,zo)g^W(*,z„)g2tE f2  +  g(x,Zo)3«Ezo)^(x.z,)i(E3) 


dzn 


dz 


dz„ 


dz„ 


Now,  let  us  use  smoothness.  From  the  geometry  given  in  Figure  E.2,  we  have 
//(x,z0  +  /(x,z0))  =  [ //  ]  ( z0 ) .  Performing  a  Taylor  series  expansion,  this  gives  us 

d/j(x,z)  I 


//(x,z0)  =  [//](z0)-/(x,z0)- 


dz 


+ 


o(/2)- 


(E.4) 


Truncating  at  2nd  order  in  /  and  its  derivatives  (including  a ),  we  see  immediately  that 
we  can  substitute  [//]  (z0)  for  //(x,z0)  in  the  second  and  third  terms  of  (E. 3).  Similarly, 

in  these  two  terms  we  can  slide  a(x,z0 )  over  to  a[x,z0  +  f  (x,z0))  =  —f  (x,z0) . 

However,  we  will  have  to  use  our  knowledge  of  the  distortion  geometry  to  evaluate 

d//(x,z0) 


a(x,z0)- 


dzn 


Taylor  series  expansion  gives  us 

~f(x,z0)  =  (x(x,z0  +  f(x,z0))  =  a(x,z0)  +  f(x,z0)da^Z0^+O(f3), 


and  taking  a  derivative 


dx 


dz 


dz 


or 


a(x,z0)  =  -f(x,z0)~  f  (x, z0 )  - / (x, z0 )  +  O (/3 ) .  (E.5) 


dz 


dz 


From  our  Taylor  series  expansions  above,  to  0(/3)  in  equation  (E.5)  we  can  again  slide 

a[x,z0 )  and  a(x,z0)  to  z0+/(x,z0)  where  they  become  -/(x,z0)  and  -/(x,z0) 
respectively.  This  leaves  us  with 


d  (x,  z0 )  =  -/  (x,  z0 )  +  /  (x,  z0 )  —  fe ^  +  /  (x,  z0 )  A  ^ 


dz 


dz 


+  0(/3).  (E.6) 


Finally,  let  us  take  d/d z0  of  equation  (E.4)  to  get 
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3//(x,z0)  3[//](z0)  3/(x,z0)  3//(x,z) 


3zA 


3z„ 


3z„ 


3z 


-f{x,z0) 


d2jll(x,z ) 


3z2 


+ 


0(/2).(E.7) 


From  equation  (E. 4),  we  see  to  first  order  in  /  we  can  replace  //(x,z0)  with  [//](z0)in 
equation  (E.7),  leaving  us  with 

d//(x,z0)  _3[//](z0)  3/(x,z0)3[//](z0)  _^32[//](z0)  |  q^2)  (Eg) 

3z,  ;u  ’  ;i"2  V  2 


3z, 


3z, 


3z, 


3z,; 


Thus  we  have 
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d2[/*](zo)' 


+  0(/3)  (E.9) 
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3zn 


+  /(,,zJ^M  +  /(,,zc)/(,,z0)TMfd  +  O(/). 


3zn 


3 zn 


3z" 


Now  we  substitute  this  back  into  equation  (E.3).  Recall  that  the  last  two  tenns  are 
already  2nd  order,  so  in  these  terms  we  are  free  to  replace  a(x,z0)  with  -/ (x,z0)  and 

ju  (x,z0 )  with  [ // ]  ( z0 ) .  This  gives  us: 

^vy«(v,)^yiv,)»+/(v/(v‘|aMh)+ 


3z0  v  u/  3z0 


3z0 


:-/(x,zjd^+/(x,z1)j^^^+/(z,z0)a/F’zd4/'](^.) 
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+/(x,z0)^3MN+/(x,z„)/(x,z„)TMN 
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,  72  ( „  „  [a](Zo)  ,  7f,„  „  \  df{x,Z  o)  3[//](z0)  f  ^ 

+  ./  (*>ZoJ - 2^ - +  ./(^Zo) - ~ - 2^ - +  0(y  j 


(E-10) 
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This  equation  holds  for  any  x  and  any  z  =  z0 .  Note  that  /  (x,z0)  is  the  distance  the 
point  (x,z0)  has  been  displaced  by  the  deformation  (note  the  displacement  is  by 
construction  vertical).  It  is  also  now  the  random  variable  characterizing  ju  (or  fi  for  that 
matter)  at  that  point.  Note  that  jit  now  contains  no  information  about  any  contour  line 
other  than  the  one  that  ran  along  the  line  z  =  z0  in  the  undistorted  problem.  Thus,  when 
we  shortly  average  /  to  find  (//)(z0) ,  we  will  not  be  slicing  along  contour  lines;  we  will 

simply  be  measuring  the  amount  the  original  contour  line  was  distorted.  In  other  words, 
we  have  effectively  collapsed  the  distorted  contour  line  back  down  to  the  original  line  at 

z  =  z0,  with  now  /,/,/,  etc.  giving  information  about  the  distortion.  For  volume 

fluctuations  near  a  rough  interface  along  which  our  smoothness  conditions  are  violated, 
this  method  of  averaging  indeed  continues  the  volume  problem  in  each  given  region 
down  to  where  the  undistorted  interface  used  to  be.  This  is  true  for  all  x  regardless  of 
where  the  interface  may  be  at  that  particular  moment.  At  the  rough  interface,  we  will 
proceed  in  a  similar  manner.  We  will  convert  the  S  -functions  to  effective  boundary 
conditions  along  the  wavy  interface,  and  then  slide  all  necessary  quantities  down  to  the 
undistorted  surface,  in  the  process  obtaining  effective  boundary  conditions  along  the 
flattened  undistorted  surface.  Again,  the  rough  interface  effectively  collapses  back  to  its 
original  position.  All  this  implies  that  our  results  for  a  fluctuating  volume  will  be  good 
right  on  down  to  the  flattened  interface,  which  then  carries  its  own  effective  boundary 
conditions.  It  is  all  fully  consistent,  and  we  are  now  ready  to  consider  the  stochastic 
problem  (which  will  be  effectively  range-independent). 

When  we  take  the  stochastic  average,  we  immediately  use  (f^j  =  0  and 

(/  (x,z0 )  /  (x, z0 ))  +  (/2  (x, . z0 ))  =  0 


at  all  values  of  the  height  z  to  show  that  (ju)  -  0  everywhere  within  the  volume. 

In  closing,  we  note  that  volume  fluctuations  can  in  principle  give  non-zero  contributions 
from  the  higher-order  terms  associated  with  range  dependence.  In  this  regard,  note 
Exercise  7  in  reference  [63].  In  equations  (26)  and  (29)  of  the  exercise,  there  is  a  higher- 
order  term  (in  that  context  related  primarily  to  the  transverse  smearing  term  rather  than 

the  downrange  smearing  term  considered  here)  quite  similar  to  the  term  X1  j 4kj  in 

equations  (3.11)  and  (C.35)  above.  As  shown  in  part  e  of  Exercise  7,  this  provides  a  non¬ 
zero  contribution  to  the  expectation  value  of  electron  states  that  are  excited  relative  to  the 
s-state.  These  states  are  zero  at  the  nucleus,  and  so  will  not  see  the  contact  potential 
there,  and  consequently  do  not  see  the  lowest  order  Lamb  shift. 
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F  Appendix:  Some  idealizations  inherent  in  the  “toy 
model”  of  the  hydrogen  atom 

In  this  appendix,  we  examine  various  aspects  of  the  sequence  of  approximations  that  take 
us  from  a  realistic  hydrogen  atom  to  our  “toy  model”  (given  by  equation  (3.37)).  Issues 
related  to  Quantum  Electrodynamics  take  us  too  far  afield  from  our  discussion  here,  and 
so  the  ramifications  of  replacing  2nd  quantization  with  fluctuations  in  the  underlying 
space  will  not  be  further  discussed  here.  Issues  related  to  the  other  approximations, 
however,  are  discussed  below. 


F.1  A  fundamental  distinction  between  the  Foldy-Wouthuysen 
transformation  for  the  Dirac  equation  and  that  for  the  Klein- 
Gordon/Helmholtz  equation 

Not  only  is  the  “toy  model”  crude  vis  a  vis  Quantum  Electrodynamics,  but  it  is  even 
crude  relative  to  a  more  realistic  description  of  the  hydrogen  atom  based  on  the  Dirac 
equation  instead  of  the  Klein-Gordon  equation.  The  distinction  of  most  significance  in 
the  context  of  our  study  of  the  Foldy-Wouthuysen  transformation  centers  on  the  fact  that 
this  transformation  generates  roughly  twice  as  many  terms  for  the  Dirac  equation  as  it 
does  for  the  Klein-Gordon/Helmholtz  equation.  These  extra  terms  introduce  new  physics 
into  the  Schrodinger  equation. 

To  see  how  this  occurs,  consider  the  O2  term  in  the  Foldy-Wouthuysen  expansion  of  the 
Klein-Gordon/Helmholtz  equation.  Here,  this  tenn  already  gives  the  0(V*) 
contribution  to  the  Schrodinger  equation,  while  the  same  term  in  the  context  of  the  Dirac 
equation  only  provides  an  0(v^,)  contribution  to  the  Schrodinger  equation.  Formally, 

we  still  have  the  same  number  of  terms  in  the  FW  transformation  as  before,  but  now  we 
are  approaching  the  Schrodinger  equation  more  slowly. 

This  leaves  room  for  a  greater  variety  of  terms.  For  example,  consider  the  term 
[0,(0,  £]]  .  For  the  Klein-Gordon/Helmholtz  equation,  this  term  participates  in 
recovering  the  expansion  of  the  square-root  operator  associated  in  some  general  sense 
with  the  relativistic  kinetic  energy  (e.g.,  yj  1  +  2  A/ k0  for  the  Helmholtz  equation).  In  the 

context  of  the  Dirac  equation,  it  becomes  an  “intermediate”  term  whose  sole  effect  is  to 
introduce  additional  physical  phenomena  into  the  problem.  This  particular  term  leads  to 
various  spin  related  effects  in  the  fine  structure  of  hydrogen,  and  also  to  the  Darwin  tenn 
for  the  hydrogen  atom  (see  page  5 1  or  reference  [57]).  The  new  physics  associated  with 
the  Darwin  term  is  Zitterbewegung.  We  will  see  below  (in  Appendix  F.2)  that  unlike  the 
Helmholtz  equation  for  a  classical  field,  the  Klein-Gordon  equation  for  the  pionic  atom  is 
modified  in  a  way  that  a  non-kinetic  energy  contribution  from  [(9,[(9,£]]  still  “sneaks 
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in”  in  the  form  of  a  Darwin  term  (of  course  there  are  no  spin  terms  for  this  spinless  field), 
but  the  appearance  of  this  Darwin  term  does  not  herald  a  doubling  of  terms  in  the  FW 
transformation  as  does  [(9,[(9,£]]  in  the  context  of  the  expansion  of  the  Dirac  equation. 


(Incidentally,  the  Darwin  term  is  interesting  because  in  some  contexts  it  resembles  the 
Lamb  shift  term  even  though  it  does  not  require  time  dependence,  and  so  it  is  a  product  of 
1st  quantization  rather  than  2nd  quantization.  For  further  discussion  concerning  the 
distinction  between  the  Lamb  shift  and  a  Darwin  tenn  and  between  2nd  quantization  and 
1st  quantization,  see  Appendix  F.3.) 


The  Foldy-Wouthuysen  transfonnation  for  the  Dirac  equation  also  contains  new 
explicitly  time-dependent  tenns  not  found  in  the  expansion  of  the  Klein- 
Gordon/Helmholtz  equation.  In  the  context  of  the  Dirac  equation,  the  time  dependent 


term 


0,6 


in  equation  (C.29)  goes  roughly  as 


(which 


in  turn  ends  up  in 


V  •  E  ),  while  the  same  term  goes  as  V2r  (dju/dx)  in  the  case  of  the  Helmholtz  equation. 
Thus,  for  the  FW  expansion  of  the  Dirac  equation,  we  would  not  expect  field-induced 
time-domain  (i.e.,  downrange)  virtual  fluctuations  to  show  up  until  the  j(9,[(9,£]J  tenn 
from  equation  (C.29). 


Thus,  we  see  that  replacing  the  Dirac  equation  for  a  spin  !4  field  with  a  Klein-Gordon 
equation  for  a  spinless  field  results  in  a  significant  change  in  the  structure  of  the  Foldy- 
Wouthuysen  transfonnation.  This  opens  up  room  for  a  great  deal  of  extra  physics.  Much 
of  the  fine  structure  of  the  hydrogen  atom  is  ultimately  derived  from  this  additional 
physics.  For  more  on  this,  see  Chapter  4  of  reference  [57]. 


F.2  The  impact  of  substituting  a  matrix  scalar  potential  in  state 
space  with  a  bona  fide  scalar  potential 

The  potential  in  pionic  atom  is  not  introduced  in  the  scalar  problem  as  was  the  case  in  our 
“toy  model”  (equation  (3.37)).  Instead,  it  is  introduced  as  a  diagonal  matrix  potential 
directly  into  the  state  space  equation  (see  pp.  202-203  of  reference  [57]). 

The  diagonal  matrix  potential  associated  with  the  pionic  atom  also  leads  to  a  Darwin  term 
that  is  absent  from  both  the  parabolic  equation  corresponding  to  the  Helmholtz  equation 
and  from  the  Schrodinger  equation  for  our  “toy  model”.  This  Darwin  term  is  emerges 
from  the  operator  [(9,[(9,£]]  ,  whose  primary  role  in  this  context  is  to  reproduce  the  3ld 

order  correction  to  the  expansion  of  the  operator  ^ \  +  7V2/m 1  (where  n1  is  the  magnitude 
squared  of  the  generalized  4-dimensional  momentum  vector  in  the  presence  of  an 
electromagnetic  4-potential  A  :  jz  -  pu  -  A  ).  In  this  sense,  this  is  an  incidental 

Darwin  term  associated  with  mixed  products  of  odd  and  even  operators,  while  the  Darwin 
tenn  for  the  Dirac  operator  illustrates  a  fundamental  change  in  the  symmetries  of  the 
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problem  (i.e.,  an  end  to  independent  time-  and  range-reversal  invariance),  and  as  such  it 
is  a  precursor  of  a  larger  class  of  extra  terms  (more  or  less  doubling  the  number  of  terms 

at  a  given  order  of  Vr  ~  V  ). 

Let  us  fold  our  further  exploration  of  the  Darwin  term  for  the  pionic  atom  into  an 
examination  of  the  distinction  between  the  downrange  Lamb  shift  and  the  Darwin  term. 
This  is  pursued  in  the  next  section. 


F.3  The  distinction  between  the  downrange  Lamb  shift  and  the 
Darwin  term 


The  Darwin  tenn,  vacuum  polarization  (i.e.,  the  downrange  Lamb  shift),  and  stochastic 
smearing  (i.e.,  the  traditional  or  transverse  Lamb  shift)  are  sometimes  confused  with  each 
other,  and  here  we  discuss  the  distinction.  The  key  difference  is  that  the  Darwin  term  is  a 
product  of  1st  quantization,  while  both  flavors  of  the  Lamb  shift  contribution  are  products 
of  2nd  quantization. 

The  Darwin  term  comes  out  of  the  term  [(9,[(9,£]]  produced  by  the  Foldy-Wouthuysen 
transfonnation.  For  a  Helmholtz  equation  with  a  scalar  potential,  the  potential  gets 
folded  into  the  operator  A  and  this  tenn  just  leads  to  the  standard  A3  tenn.  Note  that  in 
this  case,  the  scalar  potential  is  folded  into  both  the  odd  part  O  and  the  even  part  £  of 
the  state  space  equation  (i.e.,  the  matrix  representation  of  the  wave  equation). 


On  the  other  hand,  if  as  for  the  pionic  atom,  the  potential  is  a  diagonal  matrix  (i.e., 

(a  scalar  function)  •  1  =  V  (r )  ■  1 )  that  is  tacked  on  directly  to  the  state  space  equation, 

then  the  potential  is  a  part  of  the  even  operator  £  only.  In  fact,  O  is  a  scaled  back 
version  of  the  operator  A  times  f  ,  while  £  is  A  times  the  matrix  t)  plus  the  potential 
term  V  ■  1 .  £,  and  t)  are  the  same  two  odd  and  even  2x2  matrices  as  always,  while  the 
scaled  back  version  of  A  only  includes  a  generalized  kinetic  energy  term  of  the  form 
jf1/ 2m  (where  as  above  7tf  is  the  magnitude  squared  of  the  generalized  4-dimensional 
momentum  vector  in  the  presence  of  an  electromagnetic  4-potential  A  :  7tu  =  pu  -  Au ). 


Now  [(9,[(9,£]]  includes  not  only  the  A3  term  familiar  from  the  PE  expansion  of  the 
Helmholtz  equation,  but  also  a  tenn  proportional  to  [A,  [A,  F]]  «  n1 ,  \jr2 ,  F ]  .  This 


extra  term  is  a  bona  fide  Darwin  term  provided  that  we  use  the  following  as  our  working 
definition  of  the  Darwin  term  for  a  scalar  field  (as  opposed  to  a  field  with  spin  such  as 
the  electron  wave  function,  where  other  effects  also  emerge  from  the  operator 
[(9,  [(9,5]]  ):  the  Darwin  term  for  a  scalar  field  is  something  that  comes  out  of  the 

[(9,[(9,£]]  term  other  than  some  part  of  the  omnipresent  expansion  of  the  square  root 

operator  J 1  +  .  In  Section  6.1,  when  we  consider  density  jumps,  we  will  have 
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£=  A-  2  k0y  and  this  again  leads  to  something  that  we  might  technically  call  a  Darwin 
tenn:  a  term  proportional  to  [/l,[/l,-2^0y]]  . 

Note  that  the  Darwin  tenn  for  the  Dirac  equation  also  came  from  the  operator 
[(9,[(9,£]]  ,  but  in  this  case,  this  operator  is  in  no  way  connected  to  reproducing  the 
basic  Schrodinger  equation  (i.e.,  the  expansion  of  the  square  root  operator  connected  with 
relativistic  kinetic  energy).  The  term  [(^,[(9,^]]  instead  contains  a  rich  content 

associated  with  the  spin.  As  noted  above,  [(9,[(9,£]]  is  the  first  example  of  a  much 

larger  class  of  tenns  in  the  context  of  the  Dirac  equation  than  it  is  in  the  context  of  the 
pionic  atom.  (In  fact,  for  a  given  order  of  the  transverse  gradient,  the  number  of  terms  in 
the  FW  transformation  for  a  Dirac  equation  field  is  roughly  doubled  relative  to  the  order 
for  a  scalar  field.) 

To  contrast  the  Darwin  term  with  the  downrange  Lamb  shift  term,  note  that  the  Darwin 
tenn  is  not  proportional  to  a  derivative  with  respect  to  time  of  the  odd  part  of  the 
Hamiltonian  in  the  state  space  equation.  Thus,  the  Darwin  term  can  be  non-zero  even 
when  it  is  assumed  that  all  the  parameters  in  the  full  wave  equation  do  not  vary  with  time. 
Implicit  in  this  assumption  is  the  assumption  that  the  underlying  fabric  of  space-time 
does  not  vary  as  a  function  of  time.  The  Darwin  terms,  therefore,  are  a  product  of  1st 
quantization. 

On  the  other  hand,  the  term  that  produces  the  downrange  Lamb  shift  is  non-zero  only  if 
some  parameter  in  the  odd  part  of  the  state  space  equation  fluctuates  as  a  function  of  time 
(at  least  to  the  orders  considered  here;  eventually  at  very  high  order,  a  time  dependence  in 
the  even  part  could  add  terms  as  well).  To  induce  time  dependence  in  the  odd  component 
of  the  state  space  Hamiltonian  O  ,  we  need  to  introduce  time  dependence  into  the  existing 
time-independent  potentials  by  imposing  (by  hand)  fluctuations  of  the  underlying  fabric 
of  space-time  (or  equivalently  introducing  some  time-dependent  fluctuating  external 
field).  These  fluctuations  are  called  vacuum  fluctuations.  Since  our  theory  is  based  on 
ordinary  relativistic  quantum  mechanics  (and  its  reduction  down  to  its  non-relativistic 
limit),  the  vacuum  fluctuations  must  be  imposed  by  hand,  and  do  not  emerge 
automatically  from  1st  principles  as  they  would  in  quantum  field  theory.  Nevertheless,  the 
introduction  of  vacuum  fluctuations  by  whatever  means  is  called  2nd  quantization.  Thus, 
the  downrange  Lamb  shift  is  a  product  of  2nd  quantization.  In  the  context  of  quantum 
field  theory,  the  downrange  Lamb  shift  is  known  as  vacuum  polarization. 

Similarly,  the  well-known  transverse  (or  traditional)  Lamb  shift  also  occurs  when  we 
impose  fluctuations  by  hand  to  create  a  time-dependent  potential.  This  time,  contact 
potential  associated  with  the  transverse  Lamb  shift  comes  out  of  a  stochastic  averaging 
process,  but  it  does  not  take  the  form  of  a  specific  tenn  in  the  deterministic  Hamiltonian. 
In  this  way,  it  is  different  from  both  the  Darwin  tenn  and  the  downrange  Lamb  shift. 
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G  Appendix:  Generalizing  the  formalism  to 

accommodate  2-dimensional  interfaces  embedded  in 
3-dimensional  space 

This  appendix  provides  guidelines  for  constructing  the  stepping  algorithm  when  a  third 
dimension  given  by  the  unit  vector  y  is  present.  Full  examination  of  the  2-dimensional 
interface  embedded  in  3-dimensional  space  will  be  left  to  future  research.  This  issue  is 
not  particularly  urgent  from  a  practical  point  of  view,  as  most  current  work  in  the 
discipline  is  restricted  to  1 -dimensional  interfaces  embedded  in  2-dimensional  space. 
However,  the  treatment  here  is  provided  both  for  completeness,  and  even  more 
importantly,  because  an  understanding  of  this  material  provides  a  good  feel  for  some 
fundamental  issues  that  are  glossed  over  in  simpler  problem  that  is  examined  in  some 
detail  in  this  paper  (i.e.,  the  1-dimensional  interface  embedded  in  a  2-dimensional  space). 


G.1  The  stepping  procedure 

In  this  subsection,  we  examine  the  generalization  of  the  basic  stepping  procedure 
developed  for  the  simpler  2-dimensional  problem  in  Subsection  3.3.1.  Now,  we  should 
also  fix  the  step  size  Ay  just  as  we  previously  fixed  Az ,  and  begin  by  choosing  the  step 
size  Ax  so  that  the  2-dimensional  y  —  z  grid  slides  downrange  until  it  first  hits  the 
interface  somewhere.  The  situation  now  becomes  more  complicated  than  it  was  before. 
At  this  new  value  of  the  range,  it  is  very  likely  that  elsewhere  within  the  2-dimensional 
y  —  z  grid,  the  interface  will  fall  between  grid  points.  Under  such  circumstances,  it  is 
best  to  step  different  y  =  constant  lines  of  grid  points  different  downrange  distances  so 
that  they  always  hit  the  interface.  These  steps  should  be  performed  in  order  of  increasing 
downrange  tenninus. 

Note  that  some  supplementary  downrange  stepping  would  still  have  to  be  used  to 
evaluate  expressions  that  involve  many  points  on  the  transverse  grid  at  once.  Such 
expressions  are  relevant,  because  evaluation  of  H%  in  a  discretized  space  inherently 
involves  a  neighborhood  of  nearby  points  in  the  (transverse)  y  —  z  -grid.  In  particular, 

H x  contains  the  operator  (  n  is  some  integer),  which  in  turn  contains  the  operator 

dln x/dy1" .  To  calculate  such  a  quantity  at  a  given  point  on  the  grid,  we  need  to  make 
available  at  the  current  value  of  the  range  at  least  a  few  neighboring  grid  points  that  lie  on 
other  y  =  constant  lines.  (As  described  in  Subsection  3.3.2,  these  additional  points 
appear  in  finite  difference  expressions  for  the  y  -derivatives.)  The  needed  nearby 
y  =  constant  lines  of  grid  points  would  thus  be  temporarily  stepped  the  downrange 
distance  needed  to  bring  them  to  the  same  value  of  the  range  as  the  y  =  constant  line 
currently  under  consideration  (i.e.,  the  one  that  actually  intersects  the  interface  on  a  grid 


210 


point).  The  resultant  values  would  be  used  only  to  evaluate  the  non-local  function 
(H x  in  our  case)  along  the  y  =  constant  line  that  is  currently  under  consideration. 

For  points  along  this  line  that  are  near  to  but  not  directly  on  the  interface  (or  directly  on 
the  interface,  but  near  an  extremum),  this  procedure  will  typically  need  to  be 
supplemented  by  additional  interpolation  methods,  because  the  interface  might  still  pass 
between  a  nearby  pair  of  points  on  the  z  =  constant  line  that  emanates  from  the  given 
grid  point.  Note  that  this  is  an  issue,  because  the  procedure  for  calculating  derivatives 
using  a  discrete  grid  near  an  interface  requires  that  we  use  the  boundary  conditions  at  grid 
points  that  sit  directly  on  the  interface  (see  the  analysis  in  Subsection  3.3.2).  Thus,  in  an 
ideal  implementation  of  the  procedure,  we  would  temporarily  add  supplementary  grid 
points  as  needed,  so  that  the  interface  always  crosses  the  grid  directly  at  a  grid  point.  For 
a  relatively  crude  but  serviceable  alternative  to  this  kind  of  interpolation,  the  interface 
could  be  temporarily  deformed  to  coincide  with  the  transverse  grid  at  its  current  location. 
In  this  case,  the  true  values  of  interface  parameters  such  as  df  /By  (i.e.,  the  specified 
functions  of  x  and  y  )  would  still  be  inserted  into  the  boundary  conditions  that  are  used 
to  calculate  B2n%/dy2n .  (These  boundary  conditions  are  obtained  using  the  procedure 
outlined  in  Appendix  G.2  below.)  We  now  have  all  we  need  to  complete  the  procedure 
outlined  in  Subsection  3.3.2  for  calculating  B2n%/dy2"  and  consequently  H%  .  For  a 
typical  quasi-planar  surface,  the  interface  is  only  slightly  tilted  relative  to  the  horizontal, 
and  so  the  function  %  and  its  y  -derivatives  should  be  nearly  continuous  near  the 
interface,  and  the  issue  of  precisely  where  to  place  the  interface  while  calculating  the 
B"x/By"  has  at  best  modest  importance. 

The  generalization  of  the  stepping  procedure  discussed  here  in  Subsection  G.  1  thus  turns 
out  to  be  the  subtlest  issue  related  to  adapting  the  PE  formalism  developed  in  this  paper 
(for  a  1 -dimensional  interface  embedded  in  2-dimensional  space)  to  the  problem  of  a  2- 
dimensional  interface  embedded  in  3-dimensional  space. 


G.2  The  boundary  conditions 


This  subsection  provides  an  overview  of  the  key  issues  related  to  determining  the 
parabolic  equation  boundary  conditions  in  the  full  3-dimensional  problem.  The 
discussion  builds  on  the  treatment  of  the  2-dimensional  problem  provided  in  Subsection 
3.3.3.  At  the  interface,  we  should  make  temporary  use  of  a  local  right-handed  orthogonal 
coordinate  system  x  —  y'  —  z  such  that  z  is  the  direction  locally  nonnal  to  the  cut  of  the 
surface  in  the  2-dimensional  x  =  constant  plane  (and  of  course  z  is  in  that  plane  too). 

In  a  notation  that  is  independent  of  the  coordinate  system,  we  denote  this  2-dimensional 
nonnal  by  h2D .  If  z  =  /  (x,y)  in  our  original  coordinate  system,  then 


1 


l2  D 


1  +  (3//dvr 


fa/,  ^ 

v  +  z 

l  °y  J 


(0,-a//ay,l)/ \]l  +  {df/dy)2 
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The  other  axis  is  given  by  the  unit  vector 


y  = 


1 


l  +  (3//3y)2 


f„  df„) 

l  dy  J 


(0,1,3/ /dy)/ yjl  +  (df/dy)~  . 


Note  that  y  is  in  the  intersection  of  the  plane  locally  tangent  to  the  interface  and  the 
x  =  constant  plane,  and  so  we  will  give  it  the  alternate  identification  t2D  =  / . 

The  boundary  conditions  are  obtained  using  the  y  -  z  -coordinate  system.  Then  3/3/ 
and  3/3 z  are  expressed  in  terms  of  3 f  /3 y  ,  3/3 y  and  3/3 z  so  that  we  can  evaluate 
derivatives  using  the  discrete  grid  tied  to  our  fixed  coordinate  system.  To  be  specific, 

3/3/  =  t2D  ■  Yt  and  3/3 z'  =  n2D-VT  (G.l) 


with 


,3  „  3 


Note  that  higher-order  derivatives  of  /  with  respect  to  y  will  be  generated  as  3/3 y 
operates  on  t2D  or  h2D .  The  downrange  slope  /  of  the  interface  is  as  before. 

Now,  let  us  examine  some  of  the  specific  issues  related  to  obtaining  the  boundary 
conditions  in  our  local  /  -  z  -coordinate  system.  Continuity  of  X  will  guarantee  that 
derivatives  of  X  with  respect  to  /  (these  are  tangential  derivatives)  will  also  be 
continuous.  To  show  this,  use  the  follow  iterative  argument.  If  some  arbitrary  function, 
say  X  in  our  case,  is  continuous  at  the  interface,  then  the  difference  between  this 
function  evaluated  on  two  sides  of  this  interface  is  zero  (i.e.,  Xi  ~  Xu  =  0  )•  Since  the 
quantity  Xi  ~  Xu  remains  the  same  everywhere  on  the  interface,  its  tangential 
derivatives,  which  measure  the  rate  of  change  of  the  function  as  one  slides  along  the 
interface,  must  be  zero  as  well"""  .  This  in  turn  implies  that  the  tangential  derivative  of 
the  function  X  's  continuous.  Choosing  the  specific  tangential  direction  t2D  ,  we  have 

0 = UD  V  (x,  -  X a )  =  k D  ■  *x,  -  ho  ■  *  Xu  =  y  ■  Y  ,x,  -  y  •  Y  Tx„  =  %  -  /f  =  o . 

3v  ay 


Now,  iterate  and  apply  the  same  reasoning  to  the  function  t2D  ■  Vx  to  show: 


ww  This  is  one  of  several  spots  where  things  would  get  more  complicated  if  for  some  reason  X  were  not 
constant  on  the  interface.  In  this  case,  the  tangential  derivatives  would  track  the  X  -boundary  condition, 
whatever  it  is. 
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2D  ‘  ^2D  '  ^ Zl  4d  '  ^  2D 


■  =  y  ■  vr  v'  •  YtX,  ~  y  ■  Yry  ■  YtZu 


d2Z,  din 


(d/y  (dyj 


and  so  on.  There  is  a  tricky  point  here,  y  is  an  axis  on  a  coordinate  system  that  is  only 
used  at  one  point,  so  it  is  itself  a  function  of  position.  This  is  a  little  clearer  when  we  give 
it  the  label  t2D  =  y  .  The  issue  is  significant,  because  once  we  start  taking  higher-order 
derivatives,  V7  will  operate  on  t2D  =  y  .  In  other  words,  we  have  to  be  very  careful 
never  to  treat  y  and  similarly  z  as  ordinary  static  unit  vectors  attached  to  a  fixed 
coordinate  axes.  Provided  this  caveat  is  kept  in  mind,  it  is  generally  best  to  proceed  using 
compact  notation  that  treats  the  y  and  z  axes  like  any  other  axes,  obtain  the  local 
boundary  conditions,  and  only  then  use  the  identification  in  equation  (G.l)  above  to 
incorporate  the  fact  that  these  unit  vectors  may  in  fact  vary  with  position. 

Proceeding  in  this  manner,  note  that  H %  will  involve  terms  of  the  basic  form  3" zl^y" , 
3";jf/3z"  ,  and  d"2 '/dy"~mdz'" .  These  will  be  extracted  from  a  system  of  equations 
generated  by  the  boundary  conditions  on  d”zl{dy)"  ,  3"^/ (3z')"  and 
3  "xlwr  &')'■  a  "xlwr  are  continuous  at  the  interface.  Just  as  with  two 
dimensions,  terms  of  the  form  3"/f/ (3z')”  are  evaluated  using  repeated  integrations 
down  the  z  -axis  of  the  leading  order  derivative  of  X  with  respect  to  z  .  (Again,  we 
have  an  infinitesimal  integration  possibly  preceded  by  one  or  more  indefinite 
integrations.  Note  that  terms  that  only  involve  3”/f/(3v/)”  will  fall  out  during  the 

infinitesimal  integrations.)  As  we  use  repeated  integrations  to  evaluate  d"x/(dz)" ,  the 
only  tricky  part  concerns  the  integration  of  cross-terms  roughly  of  the  form 
3  "z/ (3/)“  (3 z')"  .  We  have  to  place  the  3/3 y  and  3/3zr  derivatives  in  the  order 

dictated  by  Hx  ■  As  long  as  “3/3z'”s  are  to  the  left  of  “3/3j/”s,  we  can  integrate  as 
before.  Problems  arise  when  we  come  across  the  need  to  integrate  expressions  of  the 
basic  fonn 

Jrfz'A(-).  (G.2) 


To  evaluate  something  of  the  form  (G.2),  we  will  need  to  reorder  the  partial 
differentiations  earlier  in  the  procedure  than  usual.  To  pick  up  subtle  cross-terms  related 
to  second  and  higher-order  y  -derivatives  of  /  when  we  do  so,  temporarily  fix  the 
y  -  z  -coordinate  system,  label  the  fixed  axes  by  y  and  z  ,  and  write  the  interface  as 
z  =  f  (x,  y') .  For  convenience,  choose  the  origin  to  be  the  location  on  the  interface 

currently  under  consideration.  The  higher-order  derivatives  of  f  reflect  the  curvature  of 
the  surface  (and  derivatives  of  the  curvature),  so  they  not  affected  by  translation  and 
rotation.  Thus,  3 qf  / (3 y')q  =  3 qf  / (3 'y)q  with  q  >  2  .  Now,  convert 
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8  ,  1  (  8  ,  8/  8  ' 

3-v'  Jl  +  (3//Svf  ^  3i’'32 ') 

8  .  1  f  8/  8  ,  8  v 

3z'  ji  +  (3//V)T  3f^'  3z'J 

Note  that  we  are  ultimately  only  interested  in  the  behavior  at  the  origin  where  the  slope 
df  /dy  is  zero,  so  the  distinction  between  ( y,z )  and  (y\ z  )  is  quite  subtle.  Since  the 
axes  are  now  fixed,  the  order  of  d/dy  and  d/diz  differentiations  no  longer  matters,  so 
send  to  the  right  all  the  d/dz  derivatives  (i.e.,  those  remaining  in  the  expression  denoted 
by  the  ellipsis  in  equation  (G.2);  assume  there  are  p  of  these).  Next,  write  II X  in  terms 
of  (y,  z  ) ,  and  take  a  sequence  of  indefinite  integrations  with  respect  to  z  to  get 

c)p  x/  {dz')P  in  terms  of  distributions  of  the  form  Sk  (z  —f  (x,yr))  (note  that  k  <  p- 1 ) 
and  perhaps  also  steps  of  the  form  ,  and  then  take  n  -  m  y  - 

derivativesxxx.  The  resultant  expression  can  be  integrated  with  respect  to  z  ;  we  just  have 
to  be  careful  to  convert: 

J  dz  — >  J  h2D  ■  ( dy  ,  dz') . 

All  this  effort  has  finally  given  us  the  contribution  from  a  handful  of  problematic  tenns  to 
the  integrations  that  generate  boundary  conditions  for  the  derivatives  of  the  basic  form 

d‘x/ (d^)1  •  (These  troublesome  terms  have  the  basic  form  (T x/ {^Z'T  ’’  (<V) ”  ”  {^Z'Y  •) 

From  now  on,  keep  track  of  these  specific  contributions.  Later,  when  we  use  equation 
(G.  1)  to  convert  to  the  y  —  z  coordinate  system,  treat  df  /dy  as  a  constant  for  these  terms 
only,  because  in  these  terms  we  have  already  effectively  taken  the  higher-order  y  - 
derivatives  of  / . 

Recall  that  we  currently  are  in  the  process  of  obtaining  the  full  set  of  equations  for  the 
boundary  conditions  on  dn x/dy" ,  dn x/dz"  ,  and  d" x/dz"'dy"~m  .  Furthermore  recall  that 


xxx  The  fact  that  I  df  /dy'\\  =  0  simplifies  the  result.  The  main  pitfall  here  involves  the  need  to 

V  /l(v',z')=0 

identify  new  step  functions  brought  in  by  fresh  derivatives  d/dy' .  These  we  can  deduce  from  boundary 
conditions  found  at  an  earlier  stage  of  the  process,  since  we  do  the  large  number  of  integrations  first  in 
order  to  bootstrap  ourselves  from  the  boundary  conditions  on  the  lower  orders  on  up  to  the  higher  orders 
(as  in  Appendix  Q.l). 


214 


these  come  from  the  boundary  conditions  on  terms  of  the  form  B"  x  /  {By')"  , 

()" xj ( dz')"  and  B"xl {By')"  (dz')m  ,  and  that  so  far  we  have  only  have  methods  for  the 
first  two  forms.  We  still  need  to  find  a  way  to  explicitly  evaluate  the  boundary  conditions 
on  cross-terms  of  the  form  B”  xj  {By')"  {Bz')'n  .  Note  that  the  B/By' s  and  B/Bz' s  have 
once  again  been  grouped  to  bring  the  latter  all  to  the  right.  Start  with  the  boundary 
condition  on  Bm x / (dz')"’  ,  and  this  time  write  it  as  a  difference  that  comes  out  to  zero. 
(Cfi,  Xi  ~  X n  —  0  above;  the  new  expression  equal  to  zero  will,  of  course,  also  involve 
various  quantities  evaluated  on  the  two  sides  of  the  interface.)  As  above,  all  transverse 
derivatives  B"~m/ (By')"  of  such  a  difference  will  also  be  zero.  This  gives  us  the  needed 

boundary  conditions  on  B"  xj ( By')"  (Bz')  "  (N.B.,  the  derivatives  must  be  taken  in  this 

specific  order.)  We  have  to  be  careful  to  keep  the  B/By'  and  B/Bz'  operators  in  their 
present  positions,  because  the  identifications  in  (G.l)  generate  cross-terms  of  the  form 
YAn,YTn2D,  anc'  these  in  turn  generate  subtle  modifications  to  the  boundary  conditions. 
(Keep  in  mind  that  the  now  order-sensitive  result  for 

rz/wr  (&r =&*■  r  («» -%)"x 

goes  into  a  system  of  equations  that  give  us  all  the  needed  boundary  conditions  on 
B"x/Byn ,  B" x/Bzn  ,  and  Bn x lByn~mBzm  ,  and  that  once  we  have  B" x /Byn~mBzm  ,  we  will 
again  be  free  to  switch  the  order  of  B/By  and  B/Bz  as  much  as  we  want.) 

This  is  all  we  need  to  evaluate  the  boundary  conditions  on  a  2-dimensional  interface 
embedded  in  3-dimensional  space.  Once  we  have  the  boundary  conditions,  then  we  use 
the  basic  procedure  outlined  in  Subsection  3.3.2  to  evaluate  transverse  derivatives  of  the 
wave  function  X  and  so  obtain  II X ,  which  is  then  used  to  perform  the  next  downrange 
step. 


H  Appendix:  Key  aspects  of  the  full  wave  boundary 
conditions  for  the  2nd  transverse  derivative  of  the 
field 

This  appendix  addresses  the  following  problem:  Given  that  A  solves  the  Helmholtz 
equation  ( V  ■  V  +  k2 )  A  =  0  in  2-dimensional  x  -  z  space,  and  that  the  sound  speed  (but 

not  the  density)  jumps  along  a  quasi-planar  interface  S  given  by  z  =  /(x) ,  find  the 
boundary  condition  in  terms  of  /  and  its  derivatives  to  2nd  order.  The  appendix  ends 
with  a  brief  discussion  of  the  results. 
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For  the  vector  n  normal  to  the  surface  z  =  /  (x) ,  we  have 


V  =  77  (77  ■  V)  +  (1  -7777)  ■  V  =  77  — +  (l  -7777)  ■  V 
v  '  dn 


(H.l) 


and 


V  ■  V  =  77  — 2- 
dn 


+ 


(1-7777)- V 

( 1  -  7777 )  ■  v  1  ■  r  ( 1 — 7777 )  ■  v 


a  „  a 


+  nd,indn+L 


(1-7777)- V 


dn 


(H.2) 


Now  on  S  ,  A  A  =  A,  -  An  =  0  and  also 

dAi  dAn_d(AA)_  Q 
dn  dn  dn 


Consider 


V  ■  V  =  77  — r  ■ 
dn 


(l  —  7777 )  ■  V  (AA)  +  n--n 
J  dn 


a  (A  A) 
dn 


+ 


(1  -  777?)  •  Vl  •  77  — — —  +  I” (1  -  777?)  •  V"|  ■  ["(1  -  777?)  •  Vl(A  A) 


dn 


(H.3) 


The  two  tenns  on  the  second  line  involve  tangential  derivatives  of  quantities  that  are 
always  0  on  the  interface,  so  they  fall  out,  leaving  us  with 


V-V(AA)  =  n—r-V(AA)- 
dn 


„  a  „a(A a)  „  a  „a(A a) 

■  7?  —  •  7? - - - h  7?  —  ■  7? - - - 

dn  dn  dn  dn 


=  7777 :  VV  (A  A ) 


for  A  A  =  0  (i.e.,  A  continuous)  on  a  surface  with  normal  n  . 
Now 


and  so 


7?  = 


z-f-x 

A+71 


777? 


:VV  =  r-N(f-/^-)(i-/.i):VV 

/!xc  +  (l-/=)ii-/(if  +  ii)]:VV  +  0(/5) 

O  (/’). 


J  dx2  '  1  '  dz 2  J  dxdz 


(H.4) 
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Now,  the  tangent  vector  is  given  by  t  =  x  +  f  ■  z  +  O  (/2 )  and  the  tangential  derivative  is 

—  +f  — 
dx  '  dz 


‘'■f-bsb+ols2) 


or 


»  V-/I-+ o(/!)=  3 


dz 


dx  ’ 


and  substituting  into  d2/dxdz  in  equation  (H.4),  we  have 
V-V(AA)  =  M:VV(AA) 

=r-s-^+(  i-r-f(AA) 


dx~ 


dz~ 


(H.5) 


-2fi-WddAA+1fA  <A/‘)+o(/i) 


dz 


dz2 


The  second  to  the  last  term  in  equation  (H.5)  involves  a  tangential  derivative  of  a  quantity 
that  is  zero  on  the  interface,  so  it  goes  away.  Also 


so  that 


d2(AA)  _r 

dx2 


t-V 


t-V 


(A  A)  +  0(/) 


which  is  just  the  repeated  application  of  tangential  derivatives  on  something  that  is  zero 
on  the  interface,  and  so  d2  (A  A) /dx2  =  0 .  This  leaves  us  with 


V.V(AJ)  =  (l+/!)tM+0(/>). 


(H.6) 


Also 

0  =  V  ■  VAj  +  kjAj 
0  =  V  ■  V Ajj  +  kjjAn 

so  evaluate  each  of  these  just  on  either  side  of  the  interface  and  subtract  (recalling 
Aj  =  An  =  A  on  S  ): 

0  =  V-V(A  A)  +  (k2-k2)A.  (H.7) 

Now  let  the  sound  speed  in  medium  /  be  the  reference  value  c0 . 

Thus,  with  n  =  c0/c ,  we  get  nn  =  c7/c7/ ,  and  //7/  =  (l - n), )  jl .  This  implies: 
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=>  k]  -  kl  =  kl  ( 1  -  n2 )  =  2 k0fin  . 


k,=K  1 

kn  =  nk0  j 

Substituting  into  equation  (H.7),  we  have 

0  =  V  ■  V  (A  H)  +  2k\nnA . 

From  equation  (FI. 6),  this  leaves  us  with  the  boundary  condition 

°  =  (1  +  /2) +  2koMnA  +  0(/3) ,  (H.8) 

or  more  usefully 

^pL-^f  =  -2(l-/1)kM,A+0(/3).  (H.9) 

This  suggests  that  the  C)(  /'2  j  terms  in  the  parabolic  equation  boundary  conditions 
contributed  by  the  Foldy-Wouthuysen  transfonnation  tenn  ji/Sk0  can  be  understood  as 
contributing  to  the  boundary  condition  on  d2A/dz 2  . 

The  O  (7)  tenn  is  a  little  different.  We  can  get  the  implicit  boundary  condition  on 

d3A/dz 3  by  taking  Vr  of  equation  (H.3).  Now  the  second  to  the  last  term  of  equation 

(H.3)  is  nonzero  and  can  produce  a  tenn  proportional  to  f ,  and  so  we  would  expect  the 

0(/)  contribution  to  be  associated  with  the  implicit  boundary  condition  on  d\4/dz' . 

The  3rd  order  transverse  derivative  is  a  pretty  obscure  quantity  in  the  context  of  the 
Helmholtz  equation.  It  shows  up  neither  in  the  equation  of  motion  nor  in  a  finite 

difference  discretization  of  the  Helmholtz  equation.  Thus,  the  O  (/ )  contribution  to  the 

boundary  conditions  for  this  quantity  involves  a  subtle  effect  that  is  implicit  in  full  wave 
theory,  but  must  be  made  explicit  in  the  PE.  To  get  a  crude  idea  of  the  physics  associated 

with  the  0(/)  term,  consider  the  boundary  condition 

az  4 

in  isolation.  For  junf  >  0  ,  this  allows  a  solution  that  decays  exponentially  away  from  the 

surface  as  exp(-(//77/)z/4) .  This  topic  is  considered  briefly  in  Section  4.2,  and  again 

in  more  detail  later  in  Section  5.3.3,  and  in  Appendices  N.2  and  O.  Here,  note  that  these 
boundary  wave  solutions  look  like  polaritons  (see,  for  example  the  work  of  Soto-Crespo 
et  a/.83,  or  Tang  and  Frisk  directly  in  the  context  of  acoustics84). 
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In  Section  3.2,  attention  is  concentrated  on  the  0(/2]  component  to  the  boundary 

conditions,  and  in  particular  to  the  part  related  to  the  boundary  condition  on  V2^  and  by 
extension  on  V2TA .  To  do  so,  Section  4.2  considers  the  tilted  interface  with  no  curvature, 
and  examines  this  issue  in  some  depth. 


I  Appendix:  Constructing  an  ansatz  and  obtaining  the 
state  space  equation  when  the  density  is  not 
uniformly  constant 


The  goal  of  this  appendix  is  the  construction  of  a  state  space  equation  that  is  a  suitable 
starting  point  for  the  Foldy-Wouthuysen  transformation.  To  construct  such  an  equation, 
we  need  to  fully  understand  the  philosophy  behind  this  transformation. 


The  state  space  equation  will  be  the  equation  of  motion  for  a  vector  of  the  basic  fonn 


<b  = 


X) 


(I-l) 


(e.g.,  equations  (3.2)  and  (5.2)).  For  the  acoustic  field,  6  and  J  are  scalars,  but  for 
vector  fields  they  will  be  vectors.  <£>  is  a  2-dimensional  vector  in  what  may  be  called  a 
state  space,  where  6  and  X  are  two  possible  states  (of  the  field).  The  state  space 
equation  has  the  basic  fonn 

3<f> 

-  =  H<P  (1.2) 

ox 


where  TL  is  a  2  x2  matrix  in  state  space.  Each  of  the  four  elements  of  'H  can  be 
operators  (for  vector  fields,  they  can  even  become  matrices  themselves).  For  a  classical 
field,  the  Foldy-Wouthuysen  transformation  assumes  that  the  state  space  equation  has 
been  constructed  in  such  a  way  that  is  pseudo-Hennitianyyy.  As  discussed  in  reference 
[1]  (or  equivalently  appendix  C.1.2),  an  operator  a  is  pseudo-Hermitian  when 

( r/a )  =rja with  tj  as  always  defined  in  (3.3).  (Note  that  tj  and  <^are  both  2x2 
matrices  in  state  space,  rj  is  a  kind  of  metric  on  our  state  space,  and  so  magnitudes 


yyy  More  generally,  this  assertion  holds  for  fields  that  have  integer  intrinsic  spin  (bosons).  For  fermions 
(half-integer  spin),  the  full  matrix  Hamiltonian  is  Hermitian.  However,  for  a  fermion  field,  the  equation  is 
in  the  form  of  a  state  space  equation  from  the  beginning,  and  the  derivation  in  this  appendix  is  not  needed. 
Furthermore,  this  distinction  has  very  minimal  effect  on  the  subsequent  portions  of  the  calculation. 
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squared  of  vectors  in  this  space  <J>  are  inner  products  of  the  vectors  with  themselves 
under  this  metric:  .)  As  discussed  in  reference  [1],  (or  equivalently  appendix 

C.1.2)  pseudo-Hermiticity  stabilizes  the  equation  by  guaranteeing  that  the  magnitude 
squared  of  the  vector  <J>  77$  is  conserved  under  downrange  propagation.  In  fact,  this 
conservation  law  and  the  pseudo-Hermiticity  of  the  (matrix)  Hamiltonian  are  simply  two 
manifestations  of  the  same  property. 


Under  the  Foldy-Wouthuysen  procedure,  one  then  performs  a  series  of  canonical 
transformations  <f>old  — >  <f>  ncw  .  By  canonical,  it  is  meant  that  the  transformation  is 
constructed  in  such  a  way  that  the  basic  form  of  the  equation  of  motion  is  preserved-in 
this  case: 


,3d> 


=  K . d> 


new  * 


Furthermore,  7fnew  is  still  pseudo-Hermitian.  This  latter  attribute  is  obtained  by  requiring 
that  the  transformations  be  pseudo-unitary  (i.e.,  if  <£>old  =  t/d>new,  then  U'rjU  =  ij  ;  again 
this  point  is  discussed  in  reference  [1]).  Note  that  then  cho|d  //cho|d  =  c£> new ,  and  so 
the  magnitude  of  <f>new  must  be  preserved  during  downrange  propagation  as  well,  and 
7fnew  must  indeed  be  pseudo-Hermitian. 


Thus,  the  first  step  toward  applying  the  Foldy-Wouthuysen  transformation  is  the 
construction  of  an  ansatz  (i.e.,  a  definition  for  9  and  X )  that  leads  to  a  proper  state  space 
equation.  To  guarantee  that  the  Hamiltonian  of  the  state  space  equation  H  be  pseudo- 
Hermitian,  we  choose  9  and  x  such  that  there  are  physical  constraints  that  force  <T>  //ch 
to  be  conserved  during  downrange  propagation.  Specifically,  if  we  construct  9  and  X 

from  the  original  field  in  such  a  way  that  =  |#|~  -  \x\  is  proportional  to  the 


downrange  flux  Sx ,  then  energy  conservation  would  force  the  required  conservation  of 
<f>  //<$> .  Specifically  for  an  acoustic  field  with  a  variable  density,  this  means  that  a  factor 
of  1/ p  (as  always  p  is  the  density  as  a  function  of  position)  will  somehow  need  to  be 
built  into  9  and  X  •  This  leads  to  the  ansatz  given  in  equation  (5.2)  and  repeated  here: 


9 


1 

2 

1 

Z  =  2 


^  A  +  —  —  A 

p  K  ) 


(1.3) 


A-  —  — A 


p  k{ 


0  J 


(the  dot  stands  for  a  downrange  derivative  A  =  dA/dx  ).  Note  the  general  form  of  this 
result.  If  the  downrange  flux  is  the  product  between  the  field  and  another  quantity,  then 
the  ansatz  for  9  and  X  becomes  a  sum  and  difference  between  this  field  and  that  other 
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quantity  times  i .  (When  vector  fields  are  considered  in  Section  7  and  Appendices  R  and 
S,  6  and  X  themselves  become  vector  fields,  but  the  above  reasoning  remains  valid.) 


Let  us  verify  that  this  ansatz  indeed  works: 


ee  =  - 


AA  + 


Po  AA 


+  -LP±Ua-AA*) 

p 2  K  K  p  v  . 


AA  + 


Po  AA*  2  p0 
P2  K  k0  p 


Im  (AA*) 


Substituting  -  A  for  A  gives 


XX  =4 


A  A  + 


Po  AA *  ,  2  p 


A 

p1  k]  k0  p 


+ — - 


Im  (AA*) 


Thus 


ee--xx=-^PAP  =  -2  Ac0 

K  P 


Im  (AA*) 
2pco 


=  ~2P(P()Smc  ■  X  ■ 


(1.4) 


where  5ave  =  Im  (dVdj  jlpco  is  the  time-averaged  flux  (see  reference  [98],  equations 

64.6  and  65.4  and  use  a  well-known  result  for  the  time-averaged  product  of  the  real  parts 
of  two  complex  fields;  also  compare  equation  20b  in  [1]). 


Next,  we  need  to  manipulate  the  definitions  for  6  and  X  and  the  equation  of  motion  for 
the  pressure  field  A  to  construct  the  state  space  equation.  We  will  need  to  make  use  of 
one  more  trick  to  proceed.  To  preserve  Hermiticity  down  the  line,  we  should  follow 
reference  [98]  (pp.245,  288-289)  and  group  the  overall  factor  of  the  density  with  k 2  as 
follows: 
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(1.5) 


Note  that 
k^=kl 
P  Po 


r  "2p0^ 

k2 

—  K°  | 
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[  p  j 

Po 
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Kpdxj 
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KP  J 

(1.6) 


Recall  that  when  the  density  was  everywhere  the  same,  it  was  convenient  to  characterize 
the  index  of  refraction  squared  by  a  quantity  p  =  (\-n2^j2  (see  equation  (2.5)).  The 
left  result  of  equation  (1.6)  suggests  that  we  now  have  to  generalize  this  to 


P  = 


p 


(1.7) 
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Noting  that  n  =  c0/c  =  ^j( pK ) j ( p()K() )  (recall  that  K  is  a  compressibility  and 
c  =  ^\/(pK)  ),  we  see  that  p  is  in  fact  a  measure  of  the  change  of  compressibility 


P  = 


K 


K, 


(1-8) 


o  J 


rather  than  simply  of  the  square  of  the  index  of  refraction.  The  two  turn  out  to  be 
equivalent  only  in  the  special  case  when  the  density  is  everywhere  constant.  Now  let  us 
define  an  operator 


X=  tWp)  —koJu. 

2  K 


(1.9) 


(In  this  context,  this  operator  turns  out  to  be  an  intermediate  quantity,  but  it  will 
occasionally  reappear  in  our  subsequent  analyses-for  example  in  Appendix  L.2.) 
Multiplying  equation  (1.5)  by  p0  and  making  substitutions  (1.6)  and  (1.9),  we  end  up  with 
the  following  fonn  of  the  equation  of  motion: 


3  f  -  ^ 


dx 


—  A 
P  ) 


+  2k0AA  +  kQ  A  —  0  . 


(1. 10) 


Recalling  the  ansatz  (1.3),  we  have 
A  =  d  +  % 

e-x ) 

p  i 


dx 


f  —  A 

l/7  J 


=—(v-z)’ 


leaving  us  with  the  following  alternate  form  of  the  wave  equation: 

-ik()  (P~  z)  +  2k()A  (6  +  x)  +  kl  (0  +  z)  =  0  ■ 


(1-11) 


Next,  we  obtain  a  second  equation  for  0  and  %  by  manipulating  their  definitions  in 

terms  of  A  and  A  (i.e.,  the  ansatz).  Adding  and  subtracting  the  two  equations  in  (1.3), 
we  have 


A  =  0  +  z 

ik0  —  A  =  0  -  X 
P 


A  =  6  +  Z 

A  =  -ih—(0-z)  ' 
Po 


(1.12) 


Removing  A  from  the  right  hand  pair  of  equations  in  (1.12),  rewriting  equation  (1.11), 
and  multiplying  through  by  ik0  leaves  us  with  the  following  pair  of  equations 

K  (p  +  z)  =  K —{0-z) 

Po  .  (1.13) 

ih(e~z)  =  2k0A{e+z)+kl{e+z) 
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Adding  and  subtracting  these  two  equations,  and  dividing  by  2 k0  gives  us 

w  =  he+x)  +  l^-(6+x)  +  l^-P-(6-x) 

2  2  p0 

iX  =  -H0  +  x)-^-{0  +  x)  +  ^r  —  {O-X) 

2  2  p0 


Recall  Y=y2{^-p/p0)  so  that 


kjL£-=h_k 
2  P„  2 


and  consequently 
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Applying  the  definition  A  =  A  +  k0y ,  this  gives  us 


f  A-2k0y+k0  A  ' 

U  j 

v  -a 

kX) 

and  finally  using  (1.1),  we  have 

.“/o  o 

i4>  =  \  A, 

-1  OJ 


+  (A-2k0y) 


1  0 
0  -1 


+  kn 


0  N 
~b 


which  is  just  the  state  space  equation  (5.3). 


(1.14) 


J  Appendix:  The  4th  order  Hamiltonian  n  for  a  variable 
density 

Appendix  J.l  provides  a  derivation  of  the  Hamiltonian  7i  good  for  an  acoustic  field, 
where  the  density  and  sound  speed  are  both  allowed  to  vary.  The  expression  is  quite 
general,  and  it  applies  even  if  the  density  variation  involves  a  discontinuity.  Appendix 
J.2  derives  a  simplified  Hamiltonian  for  the  special  case  where  the  density  jumps  at  an 
interface,  but  is  otherwise  constant. 
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J.1  The  general  4th  order  Hamiltonian  H  for  a  variable  density 


This  appendix  provides  the  details  of  the  derivation  of  the  matrix  Hamiltonian  'H  for  a 
variable  density  as  it  is  given  by  equation  (5.7). 

The  starting  point  deriving  our  general  H  is  the  state  space  equation  given  by  equations 
(5.3)  and  (5.4)  (or  equivalently  equation  (1.14)  above),  which  are  reproduced  below: 

i  =  77<f>  with  H  =  O  +  £  +  kjj , 
ox 


where 

0  =  y U; 

£  =  (A-2  kj)ij' 

with  the  matrices  //  and  %  as  always,  and 
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(J.2) 


oy 


2  A, 


These  values  for  O  and  £  will  be  substituted  into  the  general  4th  order  result  given  by 
equations  (3.9)  and  (3.10)  (and  equivalently  (C.28)  and  (C.29)),  and  reproduced  below 
for  convenience: 

HIV  =k07)  +  Sn ,  (J.3) 

where 
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f  o2  O4^ 


gii  _  v2^o  8*0  j 
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+  £■ 


8  k; 


8  KL 


0,0 


(J.4) 


+-^(-[0,£]2  -i{d,[0,£]}+02)  +  5th  order 


8  K 

(as  always,  the  dot  denotes  the  downrange  derivative  d/dx).  Beginning  with  the 
substitution  £  =  (A  -  2k0y)r/ ,  this  yields 
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£"  ='/|r-'/^T+'t'/- 2rk.n- 

+^[o,[o,2nrtl-^[o,o]  (J5) 

+^r(-([0,l7]-[0,2rta,])2-/{0,[0,^]}+i{0,[0,2rta7]}+C>J) 

o/r0  v 

+  5th  order. 

Now  expand 

([0,^]-[0,2^])2  = 

[O.Xrj]2  +[0,27^]' -[0^v][0ayk^]-[0,2rtM[0,H 

and  recombine  terms  to  get 

=k0r/  +  £n  =(l)  +  (2)  +  (3),  (J-6) 


Note  that  (1)  identically  reproduces  the  terms  that  were  present  in  the  constant  density 


formula,  and  note  that  according  to  equation  (J.l),  O  is  formally  the  same  as  it  was  in  the 
constant  density  case  (only  the  definition  of  A  changes  a  little,  and  that  is  immaterial 
until  we  explicitly  substitute  for  A  ),  and  so  these  terms  will  reproduce  the  constant 
density  result  given  by  (C.3 1): 


(1  )  =  J]k0  1  +  —  -^  +  ^-^-^  +-^\a,a\--^\a2,a\. 

v  ’  '  0  ^  t2  ot3  st4  st2  L  J  /lfr3  L  J 


Next,  let  us  explore  (2) ,  the  new  terms  unique  to  the  variable  density  problem  that  do  not 
contain  explicit  range  derivatives.  To  do  so,  we  exploit  the  following  identities  (obtained 
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using  the  result  tj%  =  -^rj ,  which  follows  directly  from  equations  (3.3)  and  (3.6),  the 
definitions  of  tj  and  C, ): 


f  \ 


[0,At]\  =  A2[^,t]\  =  A2 
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2  A2£tj 


f 


[0,2yk()7l\  =  2k(t\A£,?iy\  =  2k() 


Ay^rj-yArj^ 

V 


(J.9) 


=  2  k0  ( Ay+  yA)  =  2k0  {A,  y }£ij 


to  obtain  (again  using  equation  (3.6),  the  definition  of  E, ,  to  get  =  -1 ) 

[O,  Arj\  [0, 2  yk0r]\  =  4A2k0{A,y}^T]  =  -4A2k0  {A,  y}££ 

l2jJLTJ . 

=  4  A\{X,y) 


Similarly,  we  have 

[O,2yk0n\ [O, At]}  =  4 k0 {A,y}A2-^T]  =  -4 k0 {A,^}A2 

2^  "iT  , 

=  4k0{A,y}A2 


and  so 


[o,i7][o,2rM+[o.2ri(I?][o,^]=4t0{i2,{i,r)}. 


(J.  10) 


Also,  from  the  second  equation  in  (J.9),  we  have 

[o,  2yk0?]]2  =  4k2  {A,  y}2Zv£v  =  ~4k2  {A,  y}2  f  rf 

=4*2{i,)f 


(J.ll) 


To  finish  evaluating  (2) ,  we  also  need 
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^-[o,  [o,  2  rktfW = ^t[^.  2*.  R  r}^] 
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and  pulling  out  a  factor  of  &0/7 ,  we  have 

(2)  =  A:07f-2r--^RRr}}--fTRr}2+^R,Rr}} 
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Finally,  for  (3)  we  have 
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Combining  the  three  results,  we  have  from  equation  (J.6) 


(J.  12) 


(J.  13) 


(J.  14) 


227 


nIV  =  (  i)+(2)+(3) 


vK 


A  A2  A 3  5/14  /l2 


1  -2/  +  —  — -r-+  , 

kn  hi  2kl 


0  "-0  x'n'0  8^0  8£0 


-fy  fiR  /}}  -  TkR  rf + 7kR>R  r}} 


+  - 


m 


4k. 


A,  A 


2k, 


2ki 


4*03L 


/l2, /l 


4k2 


RRr}] 


+  5th  order 


(J.  15) 


which  is  equation  (5.7)  in  slightly  rearranged  form.  In  equation  (5.7),  the  tilde  ~  and 
superscript  IV  have  been  dropped  from  7 1 . 


J.2  The  4th  order  Hamiltonian  n  good  at  a  density  jump 

In  this  section,  we  take  the  general  4th  order  Hamiltonian  for  an  acoustic  field  in  an 
environment  where  the  density  varies  (equation  (5.7)  and  equivalently  (J.  15))  and  adapt  it 
to  the  special  case  where  the  density  p  (and  so  y  =  /{l  -  p/p0) )  jumps  at  an  interface, 

but  is  otherwise  constant.  The  result  is  equation  (J.24),  which  is  equivalent  to  equation 
(5.9).  While  this  discussion  invokes  single  a  quasi-planar  interface,  there  is  nothing  to 
preclude  its  application  to  multiple  interfaces,  or  ones  that  are  not  quasi-planar. 

To  simplify  the  discussion,  it  will  be  assumed  that  in  the  half-space  the  compressibility 
K  (or  equivalently  p  =  /(l-  K/ K0))  does  not  depend  on  the  range  in  the  half-spaces 

(although  it  may,  of  course,  jump  along  the  range-dependent  interface).  A  modest 
correction  to  the  formalism  accounting  for  range  dependence  of  p  in  the  half-spaces  will 
be  noted  and  tracked  through  the  calculation.  The  resulting  correction  term  is  given  in 
equation  (J.26),  which  is  the  same  as  equation  (5.10). 

This  effort  generalizes  the  calculation  that  led  to  the  manifestly  range-reciprocal  form  of 
the  parabolic  equation  for  the  case  where  the  density  was  held  constant  (see  Appendix 
C.2.5)  by  applying  the  same  basic  strategy  to  the  case  where  the  density  varies  (i.e.,  now 
use  equation  (5.7)/(J.  15)  as  the  starting  point).  The  extra  terms  associated  with  the 
parameter  y  (which  reflects  density  variation  away  from  a  reference  value)  would 
significantly  complicate  the  result.  However,  if  we  exploit  the  assumption  that  the 
density  is  constant  within  a  given  medium  and  only  changes  along  interfaces,  then  the 
result  simplifies  to  a  rather  straightforward  generalization  of  result  for  the  constant 
density  problem  (i.e.,  the  result  expressed  in  equation  (C.35)  or  equivalently  in  equation 
(3.1 1)).  Thus,  we  will  apply  extra  Foldy-Wouthuysen  transformations  patterned  after 
those  in  Appendix  C.2.5,  use  the  8  -function  procedure  outlined  in  Section  5.2  and 
Appendix  K  to  evaluate  the  resultant  expressions,  and  exploit  the  assumption  that  the 
density  is  constant  away  from  the  interface  to  simplify  the  result. 
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The  extra  Foldy-Wouthuysen  transformations  that  generate  the  manifestly  range- 
reciprocal  form  involve  the  terms  in  equation  (5.7)/(J.15)  that  are  induced  by  range 
dependence.  (Id  est,  those  terms  containing  a  downrange  derivative  signified  with  a  dot. 
These  also  happen  to  be  the  terms  associated  with  “vacuum  polarization”  effects.)  These 
four  tenns  are 


Since  only  these  terms  will  be  involved  in  the  operations  considered  here,  it  is  convenient 
to  isolate  these  tenns  from  the  rest  of  the  Hamiltonian  TL1'  by  rewriting  equation  (5.7) 
/(J.15)  as 

H:r  5th  order  (J.16) 

where 

,  „  A  A2  A3  5A4 

1 —  2y  -\ - —  h - — - — 

k0  k-  2  kl  8k^ 

i’AVq  ^<#Vq  ^/Cq 

(The  subscript  RI  stands  for  range-independent.  Note  that  for  the  moment  we  follow 
equation  (J.15)  and  retain  the  tilde  and  superscript  IV  on  H  .)  Equations  (J.16)  and 
(J.17)  reduce  to  the  constant  density  result  (C.3 1)  if  we  set  y=0. 

Let  us  begin  with  the  term  proportional  to  A2 .  Note  that 

A2  =  jL(AA)-AA.  (J.18) 

ox v  ’ 

Since  A  and  A  are  proportional  to  bifurcated  S' " 1  -functions,  A  is  evaluated  in  the  half¬ 
spaces.  Recalling  the  definition  of  A ,  equation  (5.4),  we  note  that  the  range  derivative  of 
Vr  is  zero  and  given  our  assumptions  so  are  the  range  derivatives  of  //  and  y  (evaluated 
in  the  half-spaces),  and  consequently  in  the  half-spaces  A  is  also  independent  of  the 
range.  Thus  (J.18)  becomes 

A2  =  A—-AA  =  AA-AA  =  0 

dx 

and  this  term  drops  out. 
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Now,  note  what  happens  when  we  drop  our  assumption  that  /u  is  range-independent.  At 


the  surface,  X  of  course  still  involves  a  linear  combination  of  bifurcating  8  and  S' - 
functions,  and  so  when  we  multiply  it  by  another  X ,  we  just  pick  up  that  factor  evaluated 
an  infinitesimal  distance  in  the  half-spaces.  In  this  case,  the  associative  property  to  be 
discussed  in  Appendix  K.3  is  violated:  in  other  words  it  matters  which  X  we  choose  to  be 


the  first  one  (and  consequently  bifurcate),  so  we  symmetrize  the  choice  of  which  X  to 
bifurcate.  This  gives  us 


Idx  J 


’X s 


The  subscript  S  signifies  that  Xs  is  to  be  evaluated  right  on  the  surface  S  .  The 


superscript  ±  serves  to  remind  us  that  as  Xs  bifurcates,  its  coefficients  are  evaluated  in 
the  two  half-spaces.  Note  that  since  we  now  are  allowing  fu  and  consequently  X  to 


depend  on  the  range  away  from  the  interface,  the  tenn  proportional  to  X2  will  also  have 
to  be  added  to  the  Hamiltonian  TL  away  from  the  interface.  Indicating  that  the  operator 

X2  in  its  usually  understood  simple  form  will  be  evaluated  everywhere  except  right  on 
top  of  the  interface,  we  will  call  it  a  principal  value  denoted  by  PV.  This  gives  us  result 

1 


X2^-< 


\(ds lV  ,  1 

[l&J  ’  ^ 

•  +  PV 

X2 

(J.  19) 


For  the  moment,  we  will  put  result  (J.  19)  aside,  and  proceed  assuming  X  =  0.  It  will  turn 
out  that  there  is  the  only  one  other  place  where  we  use  the  assumption  that  )i  is  range- 
independent:  in  equation  (J.23).  Therefore,  we  will  return  to  result  (J.  19)  at  the  end  of  the 
calculation,  and  examine  how  it  would  modify  the  final  result  for  Ti11  2  . 

Now,  let  us  apply  the  extra  Foldy-Wouthuysen  transformations  used  in  Appendix  C.2.5 
to  create  a  manifestly  range-reciprocal  Hamiltonian.  Once  again,  we  have  (as  in  equation 
(C.32)) 

gir.i  _  yX. 


and  as  in  equation  (C.33) 


nIVA=nIV+i 


sIV\y 


kn  +  X  —2  y - 

2k0 

new! 


S,v  \  [ SIVA ,  T]k0  ]]  -  S/F1  +  5th  order  (J.20) 


This  term  is  always  zero 
because  i)  commutes  with  itself 


Note  that  the  tenn  -  Y 


is  always  zero  because  it  reduces  to 
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1  1 
2  64 Aj 


Once  again,  S  -function  bifurcation  causes  X  to  “lift  off’  the  interface,  so  y  acts  like  a 
constant,  and  this  commutator  is  zero.  The  new  term  disappears.  Thus,  we  fully 
reproduce  equation  (C.33)  with  the  range-independent  “square-root  operator”  suitably 
generalized  as  in  equation  (J.17),  the  X2  -tenn  gone  (for  now,  but  we  note  that  it  is 
passing  through  as  before  and  would  now  equal  -rjk0  (i2/8Ay  j  if  we  kept  it),  and  we  also 

are  carrying  along  a  new  term  proportional  to  -  X,{X,  ^}J.  The  explicit  answer  (with 
Hm  given  by  equation  (J.17))  is 

5'1 

Now,  let  us  again  continue  as  in  Appendix  C.2.5,  and  choose 

5'" 2  =-^-{1,1}  (J.22) 

1  O/Cq 

as  in  equation  (C.34).  Following  equation  (C.35),  we  have 
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£7^2 
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contribution  °<  X2 
-1  which  falls  out 
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+  5th  order 
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+ 1 


+  5th  order 


This  extra  term  is  zero 


(J  .23) 


Once  again,  note  that  bifurcation  of  X  lifts  everything  off  the  interface,  and  y  operates 
like  a  constant  in  the  half-space  and  so  commutes  with  .  Thus  the  new  term  falls 

out.  Also  note  that  the  term  —SIV  2  includes  a  portion  that  is  proportional  to  X2 .  Given 
our  current  working  assumption  that  in  the  half-spaces  fd  is  range-independent  and  y  is 
constant,  this  contribution  is  zero.  Otherwise,  this  term  will  provide  exactly  the  same 
contribution  proportional  to  X2  that  it  had  previously  in  Appendix  C. 2. 5.  Recall  that 
rather  than  explicitly  tracking  the  X2  -terms,  we  are  simply  verifying  that  they  are  going 
through  unchanged  as  we  follow  the  procedure  pursued  in  Appendix  C.2.5.  At  the  end, 
we  will  simply  write  down  the  final  result  of  that  previous  calculation,  and  only  then 
make  the  substitution  (J.19).  Based  on  comparison  with  equation  (3.1 1)/(C.35),  the 
current  running  value  of  the  term  proportional  to  X2  is  rjk0  [X2/4 k^  .  This  will  not 
change  anymore. 


Thus,  we  used  the  assumption  that  the  density  is  constant  in  the  half-spaces  and  that  the 
compressibility  is  range-independent  in  the  half-spaces  to  reproduce  equation  (3.11) 
/(C.35)  with  the  range-independent  “square-root  operator”  suitably  generalized  as  in 


equation  (J.17),  the  X2  -term  gone  (for  now),  and  with  a  new  term  proportional  to 

~[X,{A,y}]: 


H1V2 


=  TjHm+TjkQ 


x  1 3 {xjy 

8  kl  1 6k04 


4k; 


r}] 


+  5th  order. 


Now,  let  us  consider  that  last  term  proportional  to  .  Now: 


4k; 


— >  bifurcate  X  —>  — 


4k; 


and  jA  works  like  a  constant  so  that 
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4k; 


4kl 

i2y 


2  f 


r 

Y,a 


+  2y 


XT  i- 


AM  =  4  y 


KSk20 


A,  A 


Thus,  back  in  equation  (J.16),  we  can  drop  the  term  proportional  to  and 

replace  it  with  the  rescaling 


8  k: 


A,A  ->(1  +  4 y)-2-  A, A 
■  J  V  nSk2i  ■ 


Now,  Sn  A  gets  rescaled  by  the  same  factor.  The  “additional  term”  is  still  zero,  but  there 
is  a  new  term  proportional  to  \jd,A2  J .  However,  this  is  5  th  order,  and  so  we  drop  it. 

Thus,  the  only  change  to  n  1  is  the  rescaling 


tjk0 


r  JP 

V  8koJ 


vK 


f  2  \ 

'<1+4r)8F 

V  8/co  J 


Transformation  S  is  unchanged,  and  so  we  have 

r  X  .  .  sp.Ip 


H“'1=nHm+7ik, 


RI  1  #/'v 0 


- t(1  +  4/)  +  -  4 

8  kly  ’  16  k* 


+  5th  order, 


(J.24) 


J 


with  Hm  given  by  equation  (J.  17).  Dropping  the  tilde  ~  and  superscript  IV. 2  on  TL , 
this  is  just  equation  (5.9). 


Now,  we  add  the  term  proportional  to  A2 .  From  comparison  to  equation  (3.1 1)/(C.35), 
noting  that  we  have  been  careful  to  monitor  our  modifications  to  make  sure  that  nothing 
new  crept  in  (i.e.,  the  A2  -tenn  should  be  the  same  as  in  that  equation),  we  have  the 
following  contribution  proportional  to  A2 : 


VK 


(A2 

v4C 


(J.25) 


Now  making  the  substitution  (J.19),  we  have  the  additional  tenn 


VK 


1 


4  kt 


fc U 

v 


,AA  +  pv 


4k4 

VH/co  J 


(J.26) 


This  is  equation  (5.10). 
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K  Appendix:  A  heuristic  development  of  the  rules  for  s- 
function  bifurcation 

This  appendix  provides  a  plausibility  argument  for  the  8  -function  bifurcation  rules  given 
in  Section  5.2.  Appendix  K.  1  considers  two  cases  where  the  rules  apply: 

■  Subsection  K.  1 . 1  examines  a  flat  interface  where  the  sound  speed  jumps,  but  not 
the  density, 

■  Subsection  K.  1 .2  considers  the  2-fluid  interface  (including  a  density  jump)  in  the 
full  wave  (Helmholtz  equation)  problem. 

Then  Appendix  K.2  discusses  the  limits  of  the  8  -function  bifurcation  prescription. 
Appendix  K.2. 1  examines  two  cases,  where  the  prescription  only  seems  to  work  to  1st 
order: 

■  The  substitution  a=\/  p  is  analyzed  in  Subsection  K.2. 1 . 1  and 

■  Tappert’s  change  of  variable  substitution  u  =  A/ -Jp  is  evaluated  in  Subsection 
K.2. 1.2. 

In  Appendix  Section  K.2. 2,  we  examine  why  some  cases  only  seem  to  work  to  first  order. 
Specially,  the  need  for  undistorted  steps  is  discussed  in  this  section,  and  then  in  Appendix 
Subsection  K.2. 3  our  observations  are  verified  when  the  a  =  1/p  case  is  extended  to  2nd 
order  using  a  Taylor  series  expansion.  Finally,  the  associative  property  for  our 
prescription  is  briefly  examined  in  Appendix  K.3.  It  should  be  noted  that  it  is  implicitly 
assumed  in  this  section  that  the  parameters  jump  at  an  interface,  but  do  not  otherwise 
vary  in  the  vicinity  of  the  interface.  If  this  condition  is  violated,  the  associative  property 
in  choosing  which  distributions  to  bifurcate  may  not  hold,  and  explicit  symmetrization 
may  be  necessary. 

For  convenience,  the  8  -function  bifurcation  rules  given  in  Section  5.2  and  supported  by 
the  arguments  in  this  appendix  are  repeated  here:  The  basic  rule  is  that  the  first  8  - 
function  obtained  splits  in  half  (i.e.,  bifurcates).  The  two  halves  are  displaced  in  opposite 
directions  away  from  the  interface.  Any  other  distributions  in  the  product  collapse  since 
they  are  smooth  functions  in  the  half-spaces  away  from  interface.  The  procedure  is 
associative  in  the  sense  that  it  does  not  matter  which  8  -function  is  chosen  to  be  the  first 
one.  On  the  way  to  generating  that  first  8  -function,  the  chain  rule  for  differentiation 
applies.  The  Vr  operating  on  the  8  -function  will  generate  higher-order  derivatives  of 
the  8  -function.  It  is  pennitted  to  multiply  through  by  a  density  since  it  involves  an 
undistorted  step,  but  in  general  it  is  not  pennitted  to  multiply  through  by  distorted  steps 
produced  by  taking  functions  of  steps. 


K.1  Cases  where  the  8 -function  bifurcation  rules  clearly  apply 

In  this  section,  we  consider  two  cases  where  the  formalism  must  somehow  work,  and 
show  that  the  8  -bifurcation  rules  reproduce  the  known  conect  answers.  Subsection 
K.1.1  considers  the  parabolic  equation  for  a  flat  interface  with  a  sound  speed  jump  only 
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(no  density  jump),  and  Subsection  K.1.2  considers  the  2-fluid  interface  (including  density 
jumps)  for  the  full  Helmholtz  equation. 


K.1.1  Flat  interface  with  no  density  jump  (8p  =  0) 

Consider  an  interface  with  a  sound  speed  jump  (but  a  single  value  of  the  density 
everywhere).  In  Appendix  Subsection  0  we  derive  the  implicit  boundary  conditions  for 
V2A  and  V4A  by  using  standard  manipulations  on  the  Helmholtz  equation.  Later,  in 

Appendix  Subsection  K.1.1 .2,  we  re-derive  these  boundary  conditions  in  the  context  of 
the  parabolic  equation  for  a  flat  interface.  In  this  context,  the  boundary  conditions  for 
V2A  and  V\A  become  explicit.  They  appear  in  cross-terms  where  derivatives  operate 

on  step  functions  to  produce  8  -functions,  which  in  turn  are  converted  to  boundary 
conditions  using  transverse  integration.  8  -function  bifurcation  must  be  used  to  obtain 
some  of  these  boundary  conditions.  Note  that  when  the  interface  is  flat  (i.e.,  range- 
independent),  there  is  no  backscatter  and  the  oo-order  parabolic  equation  is  exact. 
Therefore,  the  precise  boundary  conditions  obtained  for  the  Helmholtz  equation  in 
Appendix  Subsection  K.  1 . 1 . 1  should  apply.  Since  the  orders  decouple  (see  the  discussion 
at  the  beginning  of  K.  1 . 1 .2;  higher  orders  add  new  explicit  boundary  conditions  but  do 
not  modify  existing  lower-order  boundary  conditions),  this  agreement  should  hold  even 
for  finite-order  parabolic  equations.  Therefore,  agreement  between  the  results  obtained 
using  8  -function  bifurcation  (Appendix  Subsection  K.  1.1.2)  and  those  obtained  using 
conventional  methods  (Appendix  Subsection  K.  1.1.1)  establishes  a  context  in  which  the 
8  -function  bifurcation  prescription  is  indeed  valid. 

K.1.1. 1  Reality  check:  The  implicit  boundary  conditions  on  V2TA  and 
V4A  obtained  directly  from  the  Helmholtz  equation 

To  serve  as  a  point  of  comparison  with  the  parabolic  equation  for  a  flat  interface  with  a 
sound  speed  jump  (but  a  single  value  of  the  density  everywhere);  let  us  begin  by  first 
considering  the  corresponding  Helmholtz  equation.  This  is  a  2nd  order  partial  differential 
equation 

(V2+k2)A  =  0  (K.l) 

with  boundary  conditions  A,h  ■  VA  constant  (as  always,  ft  is  the  normal  to  the  interface). 
The  boundary  conditions  on  higher-order  derivatives  of  A  do  not  need  to  be  explicitly 
spelled  out,  because  a  2nd  order  differential  equation  with  2  boundary  conditions  fully 
determines  the  problem. 

Now  consider  the  simplest  type  of  interface:  the  sound  speed  discontinuity  (the  density  is 
the  same  everywhere)  is  now  assumed  to  occur  along  a  flat  range-independent  interface. 
The  basic  geometry  is  shown  in  Figure  K.l  below. 
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Region/ 

Region// 


Flat  interface 


f  Rt 


>x 


Figure  K.1  -  The  basic  geometry  for  a  flat  range-independent  interface,  where  the  sound  speed 
jumps,  but  not  the  density.  When  we  consider  the  parabolic  equation,  we  will  simplify  things  yet 

again,  and  let  the  2-dimensional  transverse  vector  RT  =  ( y,z)  become  the  1-dimensional  variable 
Z  (the  depth).  For  the  moment,  as  we  consider  the  full  Helmholtz  equation,  we  will  keep  the  2- 
dimensional  vector  RT  =  ( y,z)  and  its  corresponding  gradient  VT ,  but  keep  in  mind  that  although 

results  (K.3)  and  (K.8)  below  will  in  principle  apply  when  the  interface  is  a  function  of  the  transverse 
coordinate  y ,  it  is  the  1-dimensional  result  that  will  be  of  interest  once  we  move  on  to  Subsection 
K.l. 1.2. 


Now,  the  boundary  conditions  are  continuity  of  A  and  V A  .  The  continuity  of 

components  of  VA  in  the  tangential  direction  comes  as  a  bonus  from  the  continuity  of 
A  .  Recall  the  discussion  leading  to  equation  (3.5 1)  in  Subsection  3.3.2  for  a  derivation  of 
this  result.  There,  it  was  stated  that  the  continuity  of  a  given  function  implies  that  the 
difference  between  the  values  of  the  function  evaluated  on  the  two  sides  of  the  interface 
is  zero,  which  in  turn  implies  that  the  tangential  derivative  of  the  difference  is  also  zero. 
Note  that  we  can  iterate  this  argument,  beginning  with  the  continuity  of  the  derivative  in 
the  tangential  direction,  to  show  that  a  succession  of  two  or  more  derivatives  all  in  a 
tangential  direction  is  also  continuous.  Specifically,  with  x  in  a  tangential  direction  and 
A  continuous,  we  have  d"A/dxn  continuous  at  the  interface  for  all  n  . 


Now,  let  us  examine  some  of  the  implicit  boundary  conditions  on  the  higher-order 
transverse  derivatives  V'" .  As  discussed  just  below  Figure  3.8  in  Section  3.3.2,  the 
boundary  conditions  on  higher-order  derivatives  become  an  issue  in  the  higher-order 
parabolic  equation.  Then,  the  boundary  conditions  on  higher-order  transverse  derivatives 
can  no  longer  remain  implicit. 


The  Helmholtz  equation  gives 

V2A  =  ^  d 

V  TSlj 


A 


-k 

->  2  K1 
OX 


A, 


V  A  = 

V  TfLn 


v  nj 


A, 


just  inside  Region  / 


just  inside  Region  II 


(K.2) 


Taking  the  difference  between  the  two  equations  and  using  continuity  of  A  and 
d2  A/ dx1  ,  we  have  the  implicit  boundary  condition  on  V2tA  : 

A  (V2TA)  =  -(Sk2)A  (K.3) 


where 
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A  ( V^)  =  V2tAu  —  V1TAl 
Sk2  =  k2n  -k2 


(K.4) 


Next,  let  us  obtain  the  boundary  condition  on  V4A .  Take  V2r  of  both  parts  of  equation 

(K.2): 
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and  use  (K.2)  to  substitute  for  V2A : 
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Now,  subtract  these  two  equations,  recalling  that  d4  Ajdx4  and  d2  A /dx2  are  continuous. 
This  leaves  us  with 

d2A 


V4tAu  -  V44  =  (k4n  -k4)A  +  2(k2I- k] ) 


dx 


2  * 


With  Sk2  as  defined  in  equation  (K.4)  above  and  k2  =  (k/  +  k2u  )/2  ,  this  becomes 

d2A 


V4tAu  -  V44  =  2 k2  (Sk2  )A  +  2  (Sk2 ) 


dx2 


=  -2  (Sk2) 


-^A-0 

dx" 


(K.6) 


Adding  the  two  parts  of  equation  (K.2)  and  dividing  by  2,  we  get 


r  2  dx 


2  ? 


(K.7) 


and  so 


A(V4A)  =  V4rAI7  -V4rA7  =  -2(Sk2)V2A . 


(K.8) 


Thus,  equations  (K.3)  and  (K.8)  give  the  implicit  boundary  conditions  on  V2A  and  V4A 
respectively.  Note  that  as  we  compare  results  (K.3)  and  (K.8)  to  the  corresponding 
parabolic  equation  result  in  the  next  subsection,  we  will  go  to  the  1 -dimensional  problem 
where  Vr  — >  Vr  =  d/dz  . 
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K.l.1.2  The  same  boundary  conditions  on  V\A  and  V4rA  obtained  from 
the  parabolic  equation  using  S  -function  bifurcation 

Recall  that  we  are  considering  the  range-independent  constant  density  problem  so  that,  as 
discussed  in  Appendix  C.2.2  (especially  the  discussion  surrounding  equation  (C.16)),  the 
parabolic  equation  can  be  used  to  propagate  the  acoustic  field  A  itself,  and  not  just  the 
auxiliary  field  X  •  Furthermore,  the  “exact”  scalar  Hamiltonian  H  is  the  infinite  order 

expansion  in  2 A/k0  of  koy]l  +  2A/k0  (keeping  in  mind  the  usual  caveats  concerning  the 
assumption  that  the  operator  A  is  in  some  sense  small).  It  follows  immediately  that  at 
any  finite  order,  the  commutator  [A,//]  =  0  ,  and  so  A  is  an  observable.  Therefore,  we 

can  always  in  principle  expand  the  solution  A  into  eigenvectors  and  replace  A  with 
eigenvalues.  It  follows  that  for  any  given  power  n  ,  S  -functions  generated  internally  to 
A"  must  cancel,  and  the  orders  (in  n  )  must  decouple.  Therefore,  higher  orders  can  only 
add  new  boundary  conditions,  but  they  cannot  modify  the  old  ones  (when  one  uses  the 
transverse  integration  technique  outlined  in  Section  3.3.3  to  convert  the  contact  potentials 
to  boundary  conditions).  Specifically,  when  we  add  a  new  order  A " ,  only  the  first  two 
integrations  count,  and  these  will  provide  the  boundary  conditions  on  the  two  new 
derivatives  that  now  need  specified  boundary  conditions:  V2"^1  A  and  V2"~2  A  .  (The  new 
explicit  boundary  conditions  become  required  in  the  context  of  the  new  Hamiltonian 
where  lead  order  is  now  V2"A  rather  than  just  V2f2A  .)  For  our  current  purposes,  we 
will  only  consider  the  boundary  conditions  on  V2TA  and  V\A ,  and  use  A2  theory  and  A3 
theory  respectively  to  derive  them.  We  will  use  S  -function  bifurcation  on  distributions 
generated  by  cross-tenns  in  these  products  of  differential  operators.  The  fact  that  the 
correct  boundary  conditions  are  indeed  generated  will  contribute  to  our  heuristic 
validation  for  the  S  -function  bifurcation  technique. 

The  contact  potentials  (e.g.,  potentials  proportional  to  S  or  S')  generated  by  cross-terms 
in  A"  play  a  crucial  role.  For  example,  in  A2  theory,  a  S'  -contact  potential  is  needed  if 
we  are  to  avoid  having  our  transverse  integration  incorrectly  predict  that  V2A  is 
continuous  (i.e.,  A(V2Aj  =  0 ).  Continuing  on  up  to  A'  theory,  if  we  were  to  throw  out 

all  contact  potentials,  then  we  would  incorrectly  conclude  =  0  .  This,  of  course, 

contrasts  with  the  discussion  just  above  in  Section  0  concerning  the  Helmholtz  equation. 
In  this  equation,  no  derivatives  of  higher  than  2nd  order  appear  explicitly,  and  so  the 
additional  boundary  conditions  involving  discontinuity  of  the  second  and  fourth 
tangential  derivatives  of  the  wave  function  were  extracted  from  the  Helmholtz  equation 
only  indirectly,  and  no  contact  potentials  were  needed  in  the  Helmholtz  equation  itself. 
(However,  they  effectively  appeared  when  we  took  derivatives  of  the  Helmholtz 
equation.) 
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From  now  until  the  end  of  this  subsection  (Appendix  K.  1 . 1 .2),  we  will  revert  to  the  2- 
dimensional  problem,  where  x  is  the  range  and  z  is  the  depth.  Thus,  zz  replaces 
Rt  =  ( v,z)  and  Vr  and  d/dz  are  the  same.  As  always,  the  positive  z-axis  points  up 

from  Region  II  into  Region  /  (e.g.,  see  Figure  K.l  with  RT  — >  zz).  We  follow  the 
notation  first  outlined  back  in  Section  2.1,  noting  especially  that  equation  (2.5)  implies 
that  //  =  ( 1  -  n 2  )/2  and  k0n  =  k  .  Then,  choosing  the  reference  wave  number  k0=kI,  it 

follows  that  jU  =  &(-z)jUh  and 

2 klfl  =  2 kgjUn&(-z)  =  kl  (l -%)©(-z)  =  [k]  - k27)0(-z)  =  -((%2)0(-z) . 

Note  that  Sk2  =  k2n  -  k]  as  in  equation  (K.4),  and  0  is  as  always  the  Heaviside  step 
function  given  by  equation  (3.22).  All  this  gives  us 

2 k0  A  =  V2r  +  (Sk2)@(-z).  (K.9) 

Irrelevant 

overall 

factor! 

Note  the  A  theory  (i.e.,  1st  order)  instantly  gives  us  continuity  of  A  and  V rA  (using 
infinitesimal  transverse  integration). 

Now  let  us  look  at  A2  theory  and  deduce  the  boundary  condition  on  V]  A  .  Start  with 
(2 k0Af  A  =  [v2T  +  {5k2)®{-z))[v2T  +  (8k2)®(-z))A(z) 

=  V4rA(z)  +  V2  [dk2  ■  0(-z)H(z))  .  (K.  10) 

Expand  this  term 

+  Sk2  ■  0(-z) V2h|  +(^k2)  02(-z)H(z) 

These  terms  play  no  role  in  determining 
the  boundary  condition  on  V\A 


The  last  two  terms  in  equation  (K.10)  do  not  matter  in  the  present  context.  This  is  so 
because  V2  A  has  at  worst  a  step  function  and  A  is  continuous,  and  neither  has  the 
requisite  S'  -  (or  8  -)  function  for  actually  generating  a  discontinuity  boundary  condition 
in  V2A  (or  V]A )  via  the  infinitesimal-transverse-integral  procedure  outlined  in  Section 
3.3.3.  In  fact,  if  we  integrate  either  of  these  terms  over  an  infinitesimal  transverse 
integral,  we  will  get  zero. 

Expanding  the  interesting  term  gives 

V2  (Sk2  ■0(-z)H(z))  =  V7,  [-c)k2  ■  S(z)  A(z)  +  Sk2  ■  0(-z) VrH|  ] 

=  —Sk 2  ■ S'(z)A(z)-Sk 2  ■  S(z)VtA\ 

-Sk2  ■  S(z) VrA\_  +  Sk 2  ■  0(-z) v2h|  . 

Once  again  there  are  no  c)'s 
here  and  this  gives  0  after 
infinitesimal  integration 
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Recalling  equation  (3.54)  (for  the  flat  interface  z  =  /  =  0),  we  have 

5{z)x{z)  =  S{z)x{  0) 

^'{z)z{z)  =  S'(z)z{0)-S(z)z'{0), 

and  so 

V2T(Sk2 -Q{-z)A(z))^-Sk2  ■S'{z)A(0)-Sk2  ■S(z)VrA[]+Sk2 -&{-z)V2ta\  .  (K.ll) 

Stays  finite  at  the  origin 


Substituting  into  (K.10),  this  gives  us 
(2k0A)2  A  =  (V2T  +  (Sk2)@(-z))(v2T  +(Sk2)@(-z))  A(z) 

=  V4TA(z)-Sk2  -S'(z)A(0)-Sk2  -S(z)VtA\o  (K.  12) 

+2  8k2  •©(— z)V^|  +(Sk2f  02  {-z)A(z)  . 

Finite  stuff  which 
integrates  to  zero  for 
infinitesimal  intervals 


Now  perform  the  infinitesimal-transverse-integral  procedure  outlined  in  Section  3.3.3, 
noting  the  geometry  in  Figure  K.2  below. 


II 


I 


>  z  + 


Interface 
atz  =  0 


Figure  K.2  -  For  the  infinitesimal  integration,  recall  that  the  interface  is  along  the  z  =  0  line  and 
that  the  positive  z  -axis  points  from  Region  II  into  Region  /  . 

£  Z 

Following  equation  (3.53),  we  perform  the  integration  J  dz  J  dz  (with  £  — >  0  )  on 

-£  -oo 

equation  (K.12)  to  obtain 

V2TAI-V2TAI1-Sk2-A(  0)  =  0 


or 

V2A//-V2A/=A(V2A)  =  -Sk2-A(0),  (K.13) 

which  is  exactly  equation  (K.3).  Note  that  we  needed  the  S'  -function  from  the  cross¬ 
terms  in  Xr  to  get  this  result.  Incidentally,  integrating  (K.12)  once  also  gives  us  the 
similar  result 
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\(v\A)  =  -8k2  -\tA\o. 


It  is  now  possible  to  see  where  we  are  heading  with  this.  On  the  one  hand,  we  need  the 
cross-term  between  the  differential  operator  V2  and  step  function  0  to  generate  the 
proper  boundary  conditions  on  V2TA.  On  the  other  hand,  for  higher-order  A"  (where 


n  >  2  )  we  will  rapidly  begin  to  pick  up  products  of  distributions:  first  the  product 
QS  appears  and  if  we  do  nothing  about  that,  then  the  cross-terms  will  eventually  generate 
products  of  S  -functions.  Note  that  all  this  is  happening  in  a  trivial  range-independent 
problem  where  the  density  is  constant,  and  everything  should  work  out!  The  bottom  line 
is  this:  if  the  first  time  we  create  a  S  -function,  we  split  it  into  a  pair  of  half  S  -functions: 


„  SH  S1  8  8+ 

8  —  +  —  =  —  +  — 

2  2  2  2 


(K.  14) 


then  we  avoid  problems  associated  with  the  multiplication  of  distributions.  Very 
significantly,  we  will  also  get  boundary  conditions  that  are  fully  consistent  with  the  wave 
equation. 

With  this  in  mind,  let  us  examine  the  A3  -term  and  use  the  8  -function  bifurcation  rules  to 
reproduce  the  boundary  condition  on  V\A  given  by  equation  (K.8).  We  have 

{2k0A)'  A  =  (W2T  +  Sk2  ■  0(-z))(V2  +  8k 2  •  0(-z)) (v2  +  8k 2  -®{-z))A  .  (K.  15) 

Substitute  for  this 


Next,  using  equation  (K.  12)  evaluate  the  two  right-hand  appearances  of  the  operator 
( V2  +  8k 2  ■  0(-z))  first,  and  then  operate  on  the  result  with  the  left-most  appearance  of 

the  operator  (V2  +  Sk 2  ■  0(-z)) zzz.  This  gives  us 


222  Since  the  evaluation  of  this  expression  will  eventually  require  use  of  the  new  procedure  of  8  -function 
bifurcation,  it  is  fair  to  question  whether  evaluating  this  expression  by  expanding  it  using  the  product  rule 
for  differentiation  first  and  only  then  replacing  derivatives  of  the  step  function  with  8  -functions  is  really 
equivalent  to  expanding  the  two  right-hand  internal  operators  first,  replacing  them  with  8  -functions,  and 
only  after  that  applying  the  left-most  operator.  In  other  words,  does  the  associative  property  for  the 
multiplication  of  distributions  apply  even  when  8  -function  bifurcation  is  involved?  In  Section  K.3  this 
issue  will  be  addressed,  and  it  will  be  argued  that  the  associative  property  for  multiplication  applies,  even 
when  8  -function  bifurcation  is  involved. 


241 


(2k0Xf  A  =  (V2t+  8k 2  •  ®(-z))  ■ 

~Wata\  -8k2- S'(z)A(0)-8k2- 8(z)VtA\0 
+28k2  -0(-z)V^|_  +(Sk2f  ®(-z)®(-z)A(z) 

=  V6rA  |  +2  8k2 -V2  (©(-z)V^|  )  +  (8k2f  V2(@(-z)Q(-z)A(z))  _ 

-Sk2-S'"{z)A(0)-8k2  ■S"(z)87TA[+Sk2 -®(-z)VlA\ 

+2 (8k2)  -0(-z)®(-z)V^|  +  (<5k2)  0(-z)0(-z)0(-z)^4(z) 

—  ( 8k 2)  ®(-z)-8'(z)A(0)-(8k2^  -0(-z)- 8 (z)V tA\q. 

<T/2  S~/ 2 


Since  these  are  the  only  ones  that  contribute  as  we  take  a  double  integral  to  obtain  the 
boundary  condition  on  V4TA ,  we  will  only  consider  terms  that  pick  up  a  8'  function: 

\I\  =  28k2-V2T  (0(-z)V2^|  ) 

@ = (£k2)V;  (0(-z)0(-z)4z)) 

[3]  =  8k2  ■  0(-z)  V^|  •  (K>17) 

[4 ]  =  -(<5k2)  0(-z)  ■  8'(z)  A(0) 

~SA2 

Thus, 

{2k0Xf  A  =  V6TA\_  +  @  +  [2]  +  [3]  +  @  +  non-(T  terms  .  (K.  1 8) 


Now, 


[T|  =  28k2  ■  V2  ^0(-z)  V2yf|_) 


=  28k2  ■  Vr 


-8(z)V2A  +  @(-z)V3A 


use  ^-function 
bifurcation 


Follow  the  8  -function  bifurcation  prescription:  8  —>  8‘ / 2  +  8n / 2  =  S+ / 2  +  8  / 2 : 

8(z)V2A  =  — VLd  +  — V2dJ  . 

\  j  1  2  1  1  lo  2  1  11 1° 
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£ 

Under  the  relevant  infinitesimal  integration  J  dz ,  ^+/2j  +  [<^~/2]  VyT7/| 

-£ 

integrates  to  (VyT7|o  +  V7d7/  |(J  jl  =  V2TA  .  Thus,  as  long  as  we  are  clear  about  how  we 
got  there,  we  can  use  the  shorthand 


S(z)V2tA  =  —VIA,  +—V2tAu  n  <->  8(z)V2rA. 


Always  exercising  caution,  it  will  often  prove  convenient  to  proceed  in  this  way  and 
recombine  are  bifurcated  S  -functions  into  an  ordinary  8  -functions  times  averages.  We 
will  move  freely  from  one  form  to  the  other  as  needed. 


Thus, 


0  =  2 8k2 -V, 


-8(z)V2A  +®{-z)V\A\ 


=  - 28k 2  ■  8'(z)V2A  +  2Sk2 -8{z)V\A  +2 8k2 -0(-z)V4rA  . 


Again,  8 (z)  is  really  ^[^+/2]  =  [^+/2]  +  [^  /2]  ,  and  so 


8(z)V3TA\  =  [<T/2] V\A, |  +  [8~/2]  V\An  |  <->  8  ■  V3rA 


and  we  have 


0  =  -28k2  ■  8'(z)V2tA  +2 8k2-8{z)V\A  +2 8k2  -0(-z)V4A  .  (K.19) 


This  term  is  not  c*  S',  so 
it  will  not  survive  the  double 

£  Z 

integral  j*  dz  j*  dz 


Now  recall  the  argument  at  the  beginning  of  this  subsection  (Subsection  K.  1 . 1 .2).  Range 
independence  implies  that  the  distributions  internal  to  A2 A  must  cancel.  Equation  (K.  12) 
therefore  implies  that 

V4tA  =  8k2  ■  8'(z)A(0)  +  terms  that  are  not  °c  8'. 


In  expression  0 ,  this  is  multiplied  by  the  step  function 

0  z  >  0  (Region  I) 


0(-z) 


1  z  <  0  (Region//)  ’ 


so  that 


e(-z)<r(z)  =  gAl£2£)+gAl£h£l = 


(K.20) 
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Noting  that  A  is  continuous  even  at  z  =  0 ,  we  have 

,  .  .  i  „  ,  ,  .  „ . ,  .  .  .  f  terms  that  are 

®(-z)V4ta\  =  Sk2-0(-z)S  (z)A(  0)  + 

lz  [  not  °c  o 

„  ,  S'  (z)  ,  .  [ terms  that  are  1 

=  ^.^(0)+  not^,  } 


(K.21) 


Again,  at  this  point  we  might  as  well  drop  the  superscript  “  —  ”  from  the  S'  -function. 
Note  the  S  -function-bifurcation  prescription  at  work  here:  it  has  generated  a  factor  of  y2 
that  we  would  otherwise  have  missed! 


Combining  (K.19)  and  (K.21),  we  have 


=  —2Sk°  ■  S'(z) V\A  +[dk2\  -S'(z)A( 0) 

0  v  7 

[ tenns  that  will  not  contribute  1 
to  the  double  integration  j 


+ 


(K.22) 


Next,  let  us  evaluate  [2] .  This  will  be  a  great  test  for  the  S  -function-bifurcation 
prescription. 


@  =  (^2)2V2r(0(-z)0(-z)^l(z)) 


=  (Sk2^S7T 


^[Vr0(-z)]0(-z)^(z)  +  0(-z)[Vr0(-z)]^(z)^  • 
v+©(-z)  0(-z)  VtA  (z) 


Now  use  the  prescription 


vre(-z)=-A_fL 


and  equation  (K.20)  to  get 

[vr@(-z)]e(-z) = e(-z)[vr@(-z)] = 0  -  y = 

Again,  since  A  is  continuous  and  we  will  integrate,  we  can  drop  the  superscript  “  -  ”  to 
get 
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[H=(«!)Jvr[-|^(o)-|j(o)^ 

+  (Sk2^~  \_V T®(-z)\®(-z)V TA(z)  +  (Sk2^  ®(-z)\_V T®(-z)\V tA(z) 

Again  Again 

+  (c%2)20(-z)0(-z)V2T(z)  (K.23) 

=  -(^2)2  ^(z)-^(O)-(^2)2^(z)Vr^(O)  +  (^A:2)20(-z)0(-z)V2^(z) 

These  terms  go  out  after  double  integration 


=  -(#k2f  5\z)A{  0)  + 


[  terms  that  will  not  contribute 
;  to  the  double  integration 


Note  that  \l\  and  the  2nd  term  in  |T|  will  cancel. 


Next  we  can  use  equation  (K.21)  to  obtain 

H  =  ^2-0(-z)V^[  =  (^2)2 


f  terms  that  are 
|  not  oc  S' 


(K.24) 


Now  consider 

[4]  =  -(^A'2)  Q(-z)  - S'(z)  A( 0) . 

S'-/2 


As  already  noted  above.  S'  splits: 

S\z )“> 


S'+(z)  |  S'-(z) 
2  2 


and  0(-z)  is  given  by  equation  (K.20)  so  that 


0(-z) 

0(-z) 


2 


2 

=  0 


and 


0(— z)'^(z)  = 

Again  dropping  labels  since  A  is  constant  and  Sk2  is  just  a  constant  parameter,  we 
simply  pick  up  a  factor  of  x/2  and  end  up  with 
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(K.25) 


H=-(^2)2 


8\z) 

2 


A(  0). 


Note  that  [3]  and  [4]  cancel. 


Combining  (K.18),  (K.22),  (K.23),  (K.24)  and  (K.25),  we  have 

/  .  \  3  f-  1  1 — 1  i — 1  1 — 1  1 — 1  f  terms  that  are 

(2  k,XfA  =  WU\  +0  +  @  +  @  +  i+  + 

lz  [  not  oc  d 


=  V6tA  -2 Sk2S'(z)V2A 


f  terms  that  are 
0  [  not  oc  S' 


Now,  substitute  into  the  A3  -parabolic  equation  corresponding  to  the  Helmholtz  equation 
(e.g.,  see  equation  (C.35)  or  equation  (5.7)  or  (J.15)  with  y =  0 ).  As  noted  at  the 
beginning  of  this  subsection,  since  this  particular  problem  is  range-independent,  we  can 
replace  the  auxiliary  field  %  with  the  acoustic  field  A  : 


A2  A3 


k.,  +  A - f- 


2  K  2k: 


A. 


This  gives  us 


0  =  -2 Sk2 -S'(z)V2tA 


+ 


non-fi'  terms 


or 


S'  terms  internal  to  A2  that 
must  cancel  each  other  out 


(K.26) 


£  z 

Integrating  twice  (i.e.,  }  dz'  }  dz---),  we  obtain 

—£  —00 

0  =  V4tA,  -  V4tAu  -  2  Sk2  ■  V2A 


or  with  A  (V*A)  =  V^Afl  -  V4A7 ,  we  get 

A (VrA)  =  ~25k2  •  VyA  , 


(K.27) 


which  is  exactly  equation  (K.8),  the  implicit  boundary  condition  for  the  full  wave 
equation. 

Thus,  by  requiring  that  as  the  PE  adds  higher-order  powers  of  the  transverse  Laplacian 
V^, ,  the  boundary  conditions  at  a  flat  horizontal  (i.e.,  range-mdependent)  interface  remain 
consistent  with  those  set  by  the  wave  equation,  we  find  that  the  physics  clearly  forces  a 
specific  extension  to  distribution  theory:  The  S  -functions  on  the  interface  must 
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“bifurcate”.  Next,  in  Subsection  K.  1 .2  we  will  show  that  the  same  extension  to 
distribution  theory  is  also  necessary  to  recover  the  2-fluid  boundary  conditions  at  an 

interface  where  the  density  p  and  the  sound  speed  parameter  //  =  ( I  -  jj/ )  ^2  both  jump 

(i.e.,  p  and  p  are  functions  of  the  coordinates  that  contain  step  functions  at  the 
discontinuity). 


K.1.2  2-fluid  full  wave  interface  (Sp* 0) 

In  this  subsection,  we  demonstrate  that  the  S  -function-bifurcation  rules  reproduce  the 
boundary  conditions  on  a  2-fluid  interface  embedded  in  2-dimensional  x-z  space. 
Once  again,  let  us  set  up  the  same  (local)  coordinate  system  we  typically  use  in  such 
circumstances  (see  Figure  K.3  below).  In  particular,  note  that  in  the  local  coordinate 
system,  the  interface  is  at  z  =  0  and  the  z  -axis  is  (locally)  normal  to  the  interface. 


P 


ii 


x  is  the  direction 
tangential  to  the  surface 


Interface 

;  Pi 


Ydp 


II 


I 


- 1 - ► 

Interface 


z+:  the  positive 
z-axis  points 
from  Region  II 
into  Region  I 


Figure  K.3  -  The  geometry  used  in  this  section.  The  z  -axis  is  locally  normal  to  the  interface  and  the 
X  -axis  is  locally  tangent. 


The  density  jump  is  given  by  8p  =  pn  -  p}  and 

P  =  Pu-8p-®{z)  =  \Pl 

{ Pn 


z  >  0 
z  <  0 


The  field  A  obeys  the  variable  density  acoustic  equation  (equations  (2.1)  and  (5.1)) 


d_ 

dz 


-—A 
p  dz 


J 


d 

+  — 
dx 


-—A 
p  dx 


2  2 


+  - 


k:n 


A  =  0. 


P 


(K.28) 


Tangent  to  the  surface;  no 
distributions  generated  here 
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Note  that  the  x -direction  is  tangential  to  the  interface,  and  so  p,n,A  are  all  locally 
continuous  with  respect  to  this  variable.  A  translation  in  the  x  -direction  encounters  no 
jumps  in  the  density,  and  so  no  distributions  are  generated  by  the  d/dx  derivative.  This 
tenn  is  therefore  not  a  player  in  detennining  the  boundary  conditions  via  transverse 
integration. 


Now,  let  us  proceed  to  step  one  of  our  prescription:  until  we  generate  a  8  -function,  treat 
p  just  like  any  other  variable  (i.e.,  use  the  chain  rule  for  differentiation  and  multiply, 
etc.).  The  usual  rules  for  differentiation  give  us 


dz 


±*-A 

p  dz 


1  dp  dA  1  d2A 
p1  dz  dz  p  dz1 


(K.29) 


Substituting  (K.29)  into  the  wave  equation  (K.28)  and  multiplying  through  by  p ,  we  get 

d2A  1  dp  dA  d2A 


dz2  p  dz  dz 


-  + 


.,  +  k0n~A  —  0  . 
ox 


(K.30) 


Now,  naively  we  have 


dp 

dz 


=  —dp  ■  8  (z) 


where  8p  is  a  fixed  “density  jump”  parameter. 


Now  we  are  ready  to  move  to  step  two  of  our  prescription  and  bifurcate  8  : 


or 


|)=-f.<r(z)-f.«r(z) 


Substituting  into  the  modified  Helmholtz  equation  (K.30),  this  leads  to 


d2A  1 

-  +  - 


Sp  dA, 


dz2  p /  2  dz 


<T(z)  + 

0  Pll 


1  Sp  dAn 


2  dz 


8-{z)  + 


d  A  -2i,  ,, 

— — r  +  kQn  A  =  0 
ox 


£ 

Integrating  once  across  the  interface  (over  the  infinitesimal  interval:  J  dz  •  •  • ),  we  have 

-£ 
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lo 


or 


dAj 

dAn 

1 

_| _ 

dz 

dz 

1 

P, 

dAj 

dAn 

1 

-I _ 

dz 

dz 

l 

2 

1  Sp  dAj 


2  dz 


+ 


1  Sp  dAh 


pu  2  dz 


(A-( 

dAI 

1 

+  — 

r,-A) 

dAn 

V  Pi  J 

dz 

o  2 

v  Pii  J 

dz 

0  =  1 
2 


Rjl  +  i 

\PI  , 


dA, 


dz 


1 

+  — 
2 


f 


-i-A" 

Pii  J 


dA, 


dz 


f  Pn+Pi  ' 

dAj 

f  Pn+P 

,  > 
1_ 

dAn 

l  Pi  J 

dz 

0 

l 

Pu 

J 

dz  0 

1 

dAj 

1 

dAn 

Pi 

dz 

0 

Pii 

dz 

0 

(K.31) 


Similarly,  two  integrations  give  us  continuity  of  the  field  A  .  Thus,  the  S  -function- 
bifurcation  prescription  correctly  reproduces  the  boundary  conditions  on  the  field  at  an 
interface  where  the  density  jumps:  continuity  of  the  field  and  of  the  nonnal  derivative 
divided  by  the  density. 


K.2  An  important  caveat  concerning  the  S -function  bifurcation 
procedure 


As  noted  in  the  discussion  in  Section  5.2,  thec>  -function  bifurcation  prescription  only 
applies  in  certain  specified  contexts,  and  it  must  be  applied  with  care.  The  prescription 
works  well  when  the  associated  step  functions  are  raised  to  simple  powers,  and  when  it  is 
used  in  conjunction  with  the  ordinary  rules  of  differentiation  such  as  the  chain  rule  and 
the  product  rule.  However,  naively  multiplying  through  by  functions  of  the  step  function 
will  cause  problems.  For  example,  in  Subsection  K.2.1,  we  will  see  that  multiplications 


such  as  [l/(l  +  constant  •  0)]  ■  S  or  l/  R\  +  constant  ■  0  ■  8  will  produce  incorrect  results 


when  combined  with  the  bifurcation  prescription  (at  least  if  both  sides  are  weighted 
equally).  As  discussed  in  Subsection  K.2. 2,  these  functions  apparently  distort  the  step 
and  change  the  relative  weighting  factor  assigned  to  the  two  halves  of  the  bifurcated  S  - 
function.  On  the  other  hand,  as  will  be  demonstrated  in  Subsections  K.2. 1  and  K.2. 3,  we 
can  expand  these  functions  into  Taylor  series  expansions,  and  these  consist  of  sums  of 
terms  that  are  products  of  the  basic  fonn  0"  ■  8 .  Now,  for  these  clean  undistorted  steps 
we  can  use  the  8  -function  bifurcation  prescription. 
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K.2.1  Cases  where  8  -function  bifurcation  only  seems  to  work  to  1st 
order 

Here  we  examine  two  cases,  where  the  8  -function-bifurcation  prescription  only  works  to 
to  1st  order:  the  substitution  a=l/ p  and  Tappert’s  change  of  variable  (COV) 

substitution  u  =  Aj \Jp  . 


K.2.1.1  2-fluid  full  wave  interface  with  the  substitution  a  =  \/ p 

As  in  Appendix  Subsection  K.1.2,  let  us  consider  a  2-fluid  interface  embedded  in  a  2- 
dimensional  x  —  z  space.  Now,  instead  of  using  the  variable  p  ,  let  us  consider  the 
variable  a  =  1/p  .  As  shown  in  Figure  K.4,  a  jumps  at  the  interface,  and  so  it  can  be 
expressed  as  a  =  an-Sa-®(z)  where  8a  =  au  - aI .  Note  that  once  again  we  use  a 

local  coordinate  system  so  that  the  interface  is  at  z  =  0  and  the  z  -axis  is  (locally)  normal 
to  the  interface. 


a 


IT 


x  is  the  direction 
tangential  to  the  surface 


II 


Interface 

:  aT 

I  I 


r  8a 


I 


;  z+:  the  positive 

1 _  z-axis  points 

from  Region  II 
Interface  into  Region  I 


Figure  K.4  -  The  geometry  used  in  this  section.  The  z-axis  is  locally  normal  to  the  interface.  We  use 
the  new  variable  a  =  \j  p  ,  and  a  =  CCn  —  8a-  0(z)  where  8a  =  an  —  a: . 


The  variable  density  acoustic  wave  equation  (K.28)  now  takes  the  form 

0  =  Vr  +  •  •  • 

where  the  ellipsis  stands  for  terms  that  will  not  a  play  a  role  in  determining  the  boundary 
conditions  on  the  field  A  .  By  integrating  directly,  we  see  the  boundary  conditions  are 

dA 

^continuous  ;  a  ■— continuous .  (K.32) 

dz 
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Now,  let  us  show  that  to  1 st  order  in  8a ,  8  -function  bifurcation  reproduces  the  boundary 
conditions  (K.32).  The  continuity  of  A  is  trivial,  because  there  is  only  a  first  derivative 
of  a ,  and  so  no  S'  will  be  generated.  It  remains  to  derive  the  continuity  of  a  ■  dA/dz  . 

The  chain  rule  for  differentiation  gives  us 

0  =  {VTa)  {VtA)  +  gcV2tA  +  ■■■. 


Now,  8  -function  bifurcation  gives  us 

VTa  =  -SajVrA  =  -8a  * 


8+  8  x 

—  VrA++ — vta 

V  2  2  , 


and  the  wave  equation  becomes 


f  v+ 


0  =  aV2TA  -  8a 


—vta+  +  —vta 

V  2  2  , 


+  • 


(K.33) 


Now  comes  the  crucial  (and  as  it  turns  out  partly  illegal)  step:  divide  by 
a  =  an  -  8a  ■  0(z) .  Note  that  in  Subsection  K.  1 .2  above,  we  multiplied  by 

pn  -  Sp  ■  0(z) .  In  other  words,  earlier  we  multiplied  by  the  step  function,  but  now  we 

multiply  by  the  step  function  0  embedded  in  another  function;  in  this  case  the  other 
function  has  the  basic  form  l/[l  +  constant  ■  0] . 


At  any  rate,  dividing  equation  (K.33)  by  a  gives  us 


0  =  VrA- 


f  8+  8~  A 

- V? ,A,  H - V tA,j 

v2  at  2a n  y 


Sa+  - . 


Next,  we  perfonn  the  infinitesimal  integration  J  dz  and  recall  that  Vr  =  c)/c)z  for  the  2- 


dimensional  problem.  This  gives  us 


V T A,  -  V T A, ,  =  —VtA,  +  —  VtA„ 
T  1  T  11  2  a,  T  1  2a u 


and  so 


VrA, 


/  ]  8a  ^ 

= 

f  8a  \ 
1  + - 

v  2a j  j 

v  2  or u  j 

VrAn . 


(K.34) 


Result  (K.34)  would  produce  a  mess  if  we  continued  without  imposing  a  trick:  let  us  use 
a  1st  order  Taylor  series  expansion  to  rewrite  (K.34)  as 
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(K.35) 


V  j  A ,  W  j  A, , 

,  8a  ,  £a 

1  + -  1 - 

2a7  2a n 

Now, 

<5ior  _  2  a,  an  -a,  _  a,  +  au 


la, 

2a, 

2a, 

2a, 

8a 

d5 

<N 

1 

& 

i 

d5 

i 

a,  +  a „ 

2  au 

2  a„ 

d5 

<N 

<N 

1 

leading  to  the  correct  answer 


dA, 


a7  —  a n 


dz 


dAu 

dz 


(K.36) 


However,  note  that  we  had  to  restrict  ourselves  to  1st  order  in  8a .  Let  us  consider 
another  example,  Tappert’s  change  of  variable  substitution,  and  show  that  the  same  sort 
of  thing  happens. 


K.2.1.2  The  change  of  variable  substitution  A  ->  u/Jp 


In  this  subsection,  we  will  apply  the  change  of  variable  substitution  suggested  by  Tappert 
(see  pp.  262-264  of  reference  [15]),  and  show  that  to  1st  order,  8  -function  bifurcation 

will  be  needed  to  correctly  reproduce  the  continuity  condition  on  the  field  A  =  uj A Jp  . 
Once  again  as  in  Subsections  K.1.2  and  K.2.1.1,  let  us  consider  a  2-fluid  interface 
embedded  in  2-dimensional  x  —  z  space.  We  use  a  local  coordinate  system  so  that  the 
interface  is  at  z  =  0  and  the  z-axis  is  (locally)  normal  to  the  interface.  Thus,  the 
geometry  outlined  in  Figure  K.3  holds  here  as  well. 


Under  the  change  of  variable  substitution,  the  wave  equation  becomes 


0  =  pV 


p 


+  k2  -yfp  ‘ 


U  . 


(K.37) 


The  boundary  condition  on  u  itself  (as  opposed  to  its  normal  derivative)  will  come  from 
terms  involving  the  second  transverse  derivative  on  a  step  function  (which  in  this  case 
comes  from  the  density  p )  and  terms  involving  the  second  transverse  derivative  of  u  aaaa 
Thus,  equation  (K.37)  becomes 


aaaa  The  first  transverse  derivatives  of  the  step  function  and  of  U  will  be  involved  in  a  calculation  of  the 
boundary  condition  on  V Tu  much  like  that  presented  in  Subsection  K.  1.2  for  the  calculation  of  V rA  . 
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0  —  yfpV^-U  +  U -\ - 


(K.38) 


where  the  ellipsis  once  again  stands  for  terms  that  will  not  be  involved  in  determining  the 
boundary  condition  on  u  .  Now,  we  divide  by  yfp  (or  equivalently  multiply  by  1 /yfp  ). 

Once  again,  as  in  the  calculation  following  equation  (K.33),  this  is  the  crucial  and,  as  it 
turns  out,  partly  illegal  step.  Once  again  in  contrast  to  Subsection  K.1.2,  where  we 
multiplied  by  pn  -  Sp  ■  0(z) ,  here  we  multiply  by  the  step  function  0  embedded  in 

another  function  of  the  basic  form  1  /  Vi  +  constant  -0  . 


At  any  rate,  dividing  equation  (K.38)  by  yfp  gives  us  the  equation 

o = v2ru + — ?=r  v2Tjp 


Pl 


u  +  • 


(K.39) 


Note  that  z  is  in  the  normal  direction,  and  so  V^.  =  d2/dz2  stands  for  two  derivatives  in 
the  normal  direction.  Thus,  it  is  undone  by  the  infinitesimal  double  integration  in  the 

£  Z  £  Z 

nonnal  direction  }  dz  }  dz  •  •  • ,  and  so  }  dz  }  dz-V2ru  =  u,  -  un  . 

—£  — oo  —  £  — oo 


We  will  also  need  to  use  8  -function  bifurcation  to  evaluate 

^ I —  T'fP  • 


Since  p  =  pn  -  Sp  -®(z) ,  we  have 


nP  ■ 


dz 


dz 


P 


d_ 

dz 


1 


Spd_ 

4  dz 


S+  8 

r  + 


2  p 

A 


\[Pi  \l  Pii 


{-Sp) 

__8p_ 

4 


-  +  - 


V2  2y 

S,+  +  S' 


\[pi  V '  p  a 


and  multiplying  by  u/ ^[p  gives  us 


Sp 


Sp 


u  =  -^U(S/+  +  S')  =  - 

4p  v  ’  4 


ui  g'+  _|_  Uu 


P, 


Pn 


Proceeding  in  much  the  same  way  as  in  the  discussion  in  Subsection  K.  1 . 1 .2  (a  little 

below  equation  (K.18)),  we  note  that  under  the  relevant  infinitesimal  integration 
/ 

£  z 

}  dz'  }  dz--  - , 

—£  — oo 


253 


[s,+ /2](ur / p r)  +  [S'~  /2](un /  pu) 


integrates  to  (uI/p1)  +  (un / pu) ,  which  is  the  same  as 

£  z 

}  dz  }  dz[{uIlpI)  +  {uIIlpII)\-5\z). 

—£  — oo 


and  so  we  now  are  free  to  pull  out  the  bifurcated  S'  -function  and  replace  it  with  an 
ordinary  S'  -functionbbbb: 

ui  g'+  _|_  un  g'  _  _  §P_  _|_  un  g' 

KPi  Pu  J  4  l Pi  Pu) 


hi^y-T 


■jp 


(K.40) 


Substituting  (K.40)  into  (K.39)  leaves  us  with 


0  =  Viu  -  — 


f 


ui  _|_  uii  i^H _ 


V  Pi  P ii  J 


£  z 

and  integrating  twice  J  dz'  \  dz---  gives  us 

—£  — oo 


Sp_ 

4 


^  Ur  Ur,  ^ 

V  Pi  P ii  J 


[l- Sp 

=  un 

1 

+ 

■s* 
_ 1 

^Pl_ 

L  4Pii] 

(K.41) 


Once  again  we  need  to  restrict  ourselves  to  1st  order  ( O  (Sp/ p)  ).  Now  equation  (K.41) 
becomes: 


u  j 


i 

+ 

* 
_ i 

-Vi 

=  un 

[i-  *  1 

L  2  Pi. 

2Pu  _ 

(K.42) 


Noting  that 

t  ,  Sp  _2 pl+pIl-pl  _  p 

2  Pi  2p,  Pj 

t  SP  _2P„-Pu+P,  _  P 
2Pu  2Pn  Pu 


bbbb  j|lc  details  have  been  repeated  here  in  order  to  help  us  familiarize  ourselves  with  this  useful  aspect  of 
the  S  -function  bifurcation  formalism. 
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equation  (K.42)  implies 


ui  V Pi  Uu  V  P  n  • 


Furthermore, 


Uu 


and  we  have  recovered  the  continuity  condition 


4  =  4/  • 


(K.43) 


Thus,  we  have  again  shown  that  8  -function  bifurcation  works  to  O  (Spj  p) . 


The  reason  that  we  needed  to  restrict  ourselves  to  1st  order  ( O  (Sp/ p)  )  in  the  example 

just  given  here  in  K.2. 1 .2,  as  well  as  in  the  example  given  earlier  in  Subsection  K.2. 1.1, 
will  be  examined  in  Subsection  K.2.2  below. 

Finally,  note  that  in  addition  to  casting  light  on  the  applicability  of  8  -function 
bifurcation,  the  change  of  variable  procedure  raises  a  number  of  other  interesting  issues, 
and  so  we  also  further  examine  this  substitution  in  Sections  2.2  and  6.2.1. 


K.2.2  The  need  for  clean  (undistorted)  steps 

When  applied  “naively,”  the  8  -function  bifurcation  procedure  appears  to  work  only  to  1st 
order  in  some  cases  and  exactly  in  others.  It  turns  out  that  the  problem  does  not  lie  with 
the  8  -function  bifurcation  prescription  per  se.  It  is  perfectly  sensible  that  a  8  -function 
will  pick  up  the  average  between  the  values  of  a  discontinuous  variable,  provided  that 
once  we  introduce  a  step  into  a  given  parameter  (and  subsequently  take  derivatives  of  it 
to  produce  a  8  -function),  we  do  not  expect  the  exact  same  (undistorted)  step  to  also 
govern  the  behavior  of  the  jump  associated  with  another  parameter  that  depends  on  the 
original  parameter  in  some  non- trivial  manner.  The  reason  is  this:  According  to 
generalized  function  (i.e.,  distribution)  theory,  the  Heaviside  step  function  0(z)  is  really 

a  sequence  of  functions  symmetric  about  the  origin  ( z  =  0 )  that  converge  to  the  step. 

Once  we  operate  on  the  functions  in  this  sequence  (e.g.,  take  the  reciprocal,  the  square 
root,  etc.),  this  distorts  the  symmetry,  and  the  functions  will  be  weighted  differently  when 
they  multiply  the  corresponding  8  -function.  In  other  words,  the  8  -function  no  longer 
bifurcates  evenly  8  — >  8+/2  +  8~ / 2 ,  but  now  we  have  the  more  general  condition: 

8  ->a8+  +  {l-a)8-  0  <  a  <  1 .  (K.44) 
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To  further  examine  these  issues,  we  begin  by  once  again  considering  the  operator  that 
appears  in  the  variable  density  acoustic  equation  (K.28):  Vr  (1  / p)  Vr  .  Recall  that  our 
prescription  requires  that  the  chain  rule  (for  differentiation)  be  applied  first,  and  the  step 
is  only  subsequently  put  directly  into  the  density  ( p  =  pu  -  Sp  ■  0(z) ).  It  is  important  to 

note  that  our  concerns  about  distorting  the  step  function  do  not  extend  to  the  - 1  / p1  that 
is  generated  when  we  have  used  the  product  and  chain  rules  to  expand  the  operator: 

Vr  (l/  p)  Vr  =  vr  +  -  V; .  (K.45) 

P~  P 


One  the  other  hand,  as  an  alternative  we  could  have  proceeded  roughly  as  in  Subsection 
K.2. 1.1,  and  first  put  the  step  directly  into  1  J p  .  To  clarify  certain  issues  relevant  to  the 

current  discussion,  let  us  recast  the  equation  a  =  an+Sa&(z)  to 


_l_ 

P 


Pu 


0(z) 


Pu 


Pi 


(K.46) 


Substituting  equation  (K.46)  directly  into  equation  Vr  {\j  p)  V,  ,  we  generate  a  8  - 
function  by  operating  on  the  step  function  0(z)  with  VT .  Now  it  will  be  the  step  in  \j p 
that  is  symmetric  about  z  =  0  and  rather  than  the  step  in  p  .  By  itself,  this  is  fine. 
However,  in  the  other  term  in  the  expansion  of  Vr  ( 1/ p)  Vr  ,  ( 1//?)  Vj.  ,  we  are  left  with  a 

factor  of  1/ p  in  front  of  the  leading  operator  V2T .  To  integrate  with  respect  to  z  (in  our 
usual  1 -dimensional  example)  and  obtain  the  boundary  conditions,  we  will  need  to  get  rid 
of  that  1/ p  in  front  of  V2r  by  multiplying  through  by  a  p  .  But  now  we  are  multiplying  a 
8  -function  by  an  asymmetric  step,  and  we  would  have  to  adjust  the  weighting  factor 
accordingly.  It  is  not  obvious  how  to  do  that.  Now  we  can  understand  why  we  are  better 
off  putting  our  step  directly  into  p  (as  originally  suggested  in  the  previous  paragraph). 
Somehow,  the  chain  rule  automatically  provides  extra  coefficients  for  the  two  halves  of 
the  bifurcated  8  -function  generated  by  V7  {\f  p)  so  that  it  properly  corresponds  to  an 
undistorted  step  for  p  itself  (that  the  chain  rule  is  really  capable  of  doing  so  is  not  self- 
evident — see,  for  example  footnote  zz  and  the  related  discussion  in  Section  5.2 — but  it 
does  seem  to  do  the  job!).  Thus,  we  are  free  to  multiply  through  by  a  p  and  apply 
simple  8 -function  bifurcation  (i.e.,  a  =  1  -a  =  l/2  in  (K.44)).  It  is  the  coefficients 
generated  by  the  chain  rule  that  provide  the  asymmetrical  weighting. 

Let  us  now  summarize  these  insights  in  way  that  ties  them  to  the  parts  of  Appendix  K 
that  have  come  previous  to  this  point.  We  see  that  the  trouble  actually  begins  after  the 
8  -bifurcation  procedure  has  been  completed,  and  we  are  left  with  a  discontinuous 
function  in  front  of  the  leading  order  derivative  (i.e.,  the  one  that  will  be  integrated  one, 
two  and  in  general  three,  four,  or  more  times).  For  the  constant  density  parabolic 
equation  ( Sp  =  0  ),  there  is  no  such  function,  and  the  issue  never  comes  up.  On  the  other 
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hand,  as  noted  in  the  previous  paragraph,  for  the  V7.  (1/ p)  VT  operator,  we  get  (1/ p)  V2  : 
i.e.,  1  /p  times  the  leading  order  derivative.  Now  we  multiply  by  p  (i.e.,  a  simple  step 
function)  so  that  the  1  J  p  in  front  of  V2  cancels.  Elsewhere,  this  leads  us  to  multiply  our 
8  -functions  by  a  simple  step.  The  step  is  effectively  averaged,  and  our  prescription 
remains  fine.  However,  in  the  two  examples  that  fail  at  2nd  order  (see  section  K.2.1),  the 
leading  order  derivative  V2  was  multiplied  by  a ,  a  function  with  a  step,  or  yfp  ,  the 
square  root  of  a  function  with  a  step.  If  we  now  multiply  through  by  the  reciprocal 
(formally  1  /a  and  1 /  yfp  in  our  examples),  we  run  into  trouble.  Since  we  have  distorted 
the  associated  series  of  functions  that  (in  the  generalized  sense)  approach  the  step 
function,  we  cannot  expect  the  relative  weights  in  the  8  -function  bifurcation  to  provide  a 
simple  averaging,  and  indeed  they  do  not  (i.e.,  a  ^  l/2).  We  obtain  the  wrong  answer  if 
we  assume  that  to  be  the  case.  On  the  other  hand,  if  we  content  ourselves  with  a  1st  order 
result,  then  multiplication  by  the  reciprocal  is  converted  into  multiplication  by  a  clean 
step,  and  we  do  indeed  obtain  the  correct  answer  to  1st  order  in  both  our  examples. 

Note  that  this  reasoning  apparently  implies  that  we  can  in  general  use  Taylor  series 
expansions  to  guarantee  that  we  are  always  multiplying  by  powers  of  undistorted  steps, 
and  so  place  any  problem  into  a  form  where  the  8  -function-bifurcation  prescription 
applies.  In  Section  K.2.3  just  below,  we  will  provide  an  example  of  this  principle  at 
work. 

Before  proceeding  to  this  example,  let  us  consider  one  more  issue:  what  happens  if  we 
mix  two  independent  parameters,  say  the  density  p  and  the  compressibility  K . 
Everything  should  work  fine  without  further  consideration  provided  the  parameters  are 
truly  independent.  However,  what  if  we  use  the  density  p ,  but  now  replace  the 

compressibility  K  with  the  sound  speed  c  =  l/ y] pK  ?  This  raises  the  question:  does  a 
step  in  p  go  with  a  step  in  K  or  a  step  in  cl  The  answer  depends  on  the  context.  Either 
K  or  c  may  in  principle  be  considered  independent  of  p  .  The  context,  however,  will 
determine  which  variable  we  choose  to  consider  independent  of  p  ,  and  that  is  the  one 
with  which  we  associate  a  clean  (undistorted)  step.  In  the  full  wave  equation,  the  only 
8  -functions  involve  p ,  and  these  8  -functions  never  multiply  a  sound  speed  or  a 
compressibility,  so  the  point  is  mute.  However,  the  issue  arises  in  the  parabolic  equation. 
In  this  case,  the  chain  rule  will  keep  things  consistent,  but  one  choice  will  be  more 
convenient.  If  we  choose  the  compressibility  K  as  the  variable  that  is  independent  of  the 
density  p  ,  then  the  key  PE  parameter  p  depends  (linearly)  on  K  alone,  and  we  can  step 
K  (or  absolutely  equivalently  p  )  and  p  in  tandem,  and  make  no  further  adjustments. 

On  the  other  hand,  if  we  choose  p  and  c  as  our  independent  variables,  now  p  picks  up 
a  p  -dependence  (as  well  as  a  1  / c2  -dependence),  and  we  will  need  to  use  the  chain  rule 
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in  evaluating  p  when  it  is  operated  on  by  a  derivativecccc.  Since  things  are  much  simpler 
when  we  use  the  compressibility  K  (or  completely  equivalently  //  ),  we  will  take  this  to 
be  the  physical  parameter  that  is  independent  of  the  density  p  . 


K.2.3  Example:  Extending  the  treatment  of  a  =  \/p  to  2nd  order  using 
Taylor  series  expansion 

To  further  verify  the  hypothesis  that  the  8  -function-bifurcation  procedure  applies  as  long 
as  the  8  -function  is  multiplied  by  clean  undistorted  steps,  let  us  return  to  the  example  in 
Subsection  K.2.1.1,  and  expand  to  1st  order  in  8a  the  factor  of  1 /a  multiplying  the 
bifurcating  8  -function.  Previously,  we  used  0th  order,  so  this  will  overall  generate  an 
extra  order  in  8a  beyond  that  previously  used.  Having  expanded  in  a  Taylor  series,  we 
are  now  simply  multiplying  through  by  a  legitimate  (undistorted)  step  function.  Since  the 
step  function  multiplies  a  8  -function  that  is  already  1 st  order  in  8a ,  this  maneuver  will 
gain  us  one  more  order  of  accuracy  overall  than  the  order  of  the  expansion  of  1  Ja — an 

overall  accuracy  to  O  [8a2 )  in  this  case. 


Note  that  the  geometry  is  again  given  by  Figure  K.4.  We  consider  2-dimensional  x-z 
space  and  we  have  a  =  au  -  8a-  0(z) ,  where  8a  =  au  -  a{ .  We  are  evaluating  the 
equation 


0  -  Vr  A] )  +  •  •  • 


(where  the  ellipsis  stands  for  terms  that  will  not  a  play  a  role  in  detennining  the  boundary 
conditions  on  the  field  A  ).  Applying  the  chain  rule  in  this  context  gives  us 


0  =  Vr  ([dVTA])  +  ---  =  (ff2TA  +  (V  Ta)W  TA  +  •  ■  ■ 


■■  aV2TA  -  8a 


—vta++—vta 

v2  2  , 


+  • 


(K.47) 


cccc  Even  cross-terms  between  Vr  ( 1  //?)  Vr  and  1/ p  should  pose  no  problems  provided  we  use  the 
chain  rule  for  differentiation  to  generate  our  first  8  -function.  In  fact,  the  same  is  also  true  for  all  the  cross¬ 
terms  associated  with  higher  orders  of  the  operator  Vr  ( 1  //?)  Vr  .  To  see  that  this  must  be  true  for  the 

broad  class  of  cases  mentioned  here,  just  let  everything  to  the  right  of  the  leftmost  Vy  (4/ p)VT  operator 
take  the  place  of  A  in  the  calculation  of  Section  K.1.2. 
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Now  we  multiply  through  by  a  1  /a .  The  a  in  the  first  term  cancels.  Now,  as 
advertised,  let  us  take  the  factor  of  \/a  multiplying  the  bifurcating  8  -function  and 
expand  it  to  first  order: 


1  -  1  1  1 

1 

1  S 

fi-*W 

l  aa  ) 

a  a 

8a 


1  + — ®(z)  +  o(8a2). 


a, 


(K.48) 


Substituting  (K.48)  into  (K.47)  multiplied  by  1  /a,  we  obtain 


o  =  v2ta- 


■8a 


8^J_ 

2  a„ 


f .  8a 
1  +  — 


V 


a, 


■n  j 


e-  i 

V rA,  h - V TAn 

2  a 

Z  un  J 


- hO(&t3) .  (K.49) 


Recall  that  for  our  2-dimensional  example,  Vr  =  d/dz  and  integrate  J  dz  to  get 


0  =  VtAj  -  VTAIf 


8a 


2  a 


f  8a ' 


1  + 


7/  V  ^11  J 


8a 


VrA,  ~—VrAn  +  O (8a2 ) 
2  a 


and  consequently 


vr  4 


8a 

(8af  '' 

=  VtAi 

2a u 

2al  j 

8a 

(8a)2  ^ 

(8a)~ 

2a u 

4  a2,  j 

4  al 

f  8a  ' 
1  +  - 


V  ^an  J 


+ 


o(8a3) 


V  A  =V  A 

2  T^-I  v  T^-II 


f .  8a  A 

1  + 


v  2 an  j 


+ 


0(&r3) 


Now,  Vj-Aj  =  VTAU  +  0(8a)  and  so  to  0(&t3)  we  can  replace  VrAj  with  VrAu  in  the 

term  just  to  the  left  of  the  =  sign  and  move  it  to  the  right  side  of  the  equation: 

(  \ 


8/  tAi 


1_  8a  _  (8a)2 
2  an  4  a], 
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+  0(e>Gf3).  (K.50) 


Now,  note  that 
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Thus  to  O  ( Sex  ' )  we  can  substitute  a,  for  an  on  the  left  side  of  equation  (K.50): 
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Now  use 
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to  obtain 
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(K.51) 


This  is  exactly  equation  (K.35)  in  Subsection  K.2.1.1.  As  noted  there,  this  quickly  leads 
to 

V  tA,  V  tA,, 


2a,  +  a„  -  a, 

2a „  -  a„  +  aj 
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<N 

_ 1 
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o(&t3) 


U 

a,VrA,  =  anS7TAn  +  O  (Sa' ) 


(K.52) 


as  desired. 

By  expanding  the  coefficient  of  the  bifurcating  S  -function  in  a  Taylor  series,  we  have 
converted  from  multiplication  by  distorted  steps  to  multiplication  by  undistorted  steps, 
and  so  we  have  been  able  to  increase  the  order  to  which  the  final  result  is  valid.  Since  the 
bifurcating  S  -function  already  carries  one  order,  the  order  of  validity  turns  out  to  be  one 
greater  than  the  order  of  the  Taylor  series  employed. 


K.3  Verification  of  the  associative  property  for  3 -function 
bifurcation 

The  associative  property  is  built  into  basic  distribution  theory.  This  is  the  main  reason 
behind  those  expansions  connecting  S{ll)  (z  -  f)z(z )  with  "*  (z~  f)  X(f)  (see 
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footnote  11  concerning  equation  (3.54)).  8  -function  bifurcation  is  a  linear  operation,  and 
it  does  not  affect  these  equalities. 

However,  the  8  -function  bifurcation  prescription  begs  the  following  question:  does  the 
order  in  which  we  take  nested  derivatives  and  consequently  “kick  up”  8  -functions 
matter?  Interestingly,  the  answer  seems  to  be  that  the  order  does  not  matter.  Let  us 
consider  several  examples. 

In  the  first  example  to  be  considered  here  in  Section  K.3,  note  that  we  evaluated  equation 
(K.  15)  by  going  from  right  to  left,  taking  derivatives  and  converting  to  bifurcated  8  - 
functions  as  they  are  generated.  This  is  sensible,  but  if  the  associative  property  for 
multiplication  were  to  hold,  we  should  also  be  able  to  apply  the  chain  rule  going  from  left 
to  right,  and  get  the  same  answer.  This  is  indeed  the  case. 


To  see  why  things  will  always  work  out,  first  consider 

d_ 

dz 


0(z)- 


We  could  either  begin  with 


30 

dz 


8{z )-> 


8+(z)  |  8(z) 
2  2 


and  so  (recalling  0+  =  0(z  >  0)  =  1  and  0  =  0(z  <  0)  =  0 ) 
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and  thus 
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(K.53) 


Alternately,  we  could  proceed  as  follows: 
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30  30 
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8'+  (z)  3 [l]  8'  (z)3[0]  S'+  (z)  S'+  (z) 
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(K.54) 


The  “extra”  terms  go  away,  and  we  see  that  we  can  evaluate  derivatives  (and  replace 
them  with  bifurcating  8  -functions)  at  any  time  during  the  calculation. 


Similarly,  note  that 
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3 2  r  ,  x  ,  x-|  3  30  ,  x  x  x30~]  3  |~^+(z)  ^  (z) 

^[0(z)0(z)]  =  -  —  0(z)  +  @(z)  —  ^  +  ^ 


3z  3z 

=<n-) > 


or  alternately 

32  r  /  /  x-i  320(z)  x  x320(z)  3030 
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wwh 3  3 

3z  3z 

=  ££)  +  ££) 

2  2  v  7 

Again,  the  “extra”  tenns  disappear  and  we  can  proceed  in  either  order. 

Now,  consider  A4  (  8p  =  0  )  theory.  Here,  we  would  get  a  tenn  roughly  like 


—  [0(z)0(z)0(z)] 


=  —  [£(z)0(z)0(z)  +  0(z)£(z)0(z)  +  0(z)0(z)£(z)] . 
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3z  2  2  2  2 


Alternately,  we  could  proceed  with 
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Again,  all  the  “extra”  terms  contain  a  product  of  the  form 

30  30  _  S+  3  [l]  _ 

3  z  3 z  2  3z 
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and  so  they  go  away.  The  same  goes  for  a  term  like  V2T©VT©  (which  is  similar  to  one 
associated  with  a  term  like  V2T  y/LN2r  ). 

We  can  start  to  see  why  things  are  working  out.  Cross-terms  of  the  basic  fonn 

30  30 
3  z  3 z 


always  go  away.  Thus,  every  time  we  have  a  product  of  step  functions,  and  start  taking 
derivatives,  the  chain  rule  for  multiplication  will  lead  to  a  sum  of  terms  each  with  one 
step  function  differentiated  and  the  others  untouched.  Henceforth,  all  further  derivatives 
in  a  given  product  of  step  functions  will  focus  only  on  one  step  function.  In  other  words, 
the  first  S  -function  caused  the  rest  of  them  to  collapse.  We  can  either  apply  S  -function 
bifurcation  right  away  and  so  avoid  generating  the  superfluous  terms;  or  we  can  wait  and 
drop  the  superfluous  terms  later.  We  will  get  the  same  answer  either  way. 


Note  that  these  arguments  also  apply  to  the  theory  for  variable  density  ( dp  ^  0  ).  For 
example,  consider  the  “sandwich  term”  in  A2  -theory: 


1 


-^t(px) 

\P 


Whether  we  apply  to  the  chain  rule  to  the  “inner”  Vy.  operator  first  and  bifurcate 
immediately  as  S  -functions  are  generated,  or  apply  the  chain  rule  for  multiplication  from 
left  to  right  and  again  bifurcate  as  S  -functions  are  generated,  or  apply  the  chain  rule  for 
multiplication  from  left  to  right  and  bifurcate  at  the  end,  we  will  get  the  same  answer  in 
all  cases. 


Now,  note  also  that  as  we  introduce  downrange  derivatives  3/3x ,  the  associative 


property  continues  to  apply.  For  example,  let  us  consider  the  important  example 


—  _ 

VT£-VT,y 

L  p  \ 

(K.55) 


(This  expression  shows  up  as  we  derive  the  manifestly  reciprocal  form  for  the 
Hamiltonian  where  there  is  a  density  jump,  equation  (5.9);  see  also  Appendix  J.2  for  the 
details.)  Of  course,  if  we  take  dA/dx  first,  bifurcation  puts  us  in  one  medium  or  the 


other,  and  y  is  a  constant  in  the  half-spaces  and  commutes  with  everything,  and  so  the 
commutator  in  equation  (K.55)  is  obviously  zero.  Now,  let  us  evaluate  the  expression 
another  way: 


£vrte) 

-yvT 

^ rX 

IP 

IP  J 

Expanding  VT ,  we  have 
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-Vr(K)  + 
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Now, 

v T{n)  =  (yTr)z+r(yTz ) 

and 

K()z  )  =  vr  [(vTr)z+r(vT, r)] 

=  (v1Tr)z+2(vTr)(vTz)+ r^lx)' 


Thus,  we  have 


Thus,  we  only  have  products  of  derivatives  of  steps  left.  Bifurcating  VTy  first,  p  is 
constant  in  the  half-space  and  p-  0  ;  bifurcating  p  first,  VTy=0.  Either  way, 

[i>r\  =  o. 

For  clarity,  we  will  always  take  d/dx  first  and  generate  our  bifurcating  8  -functions  right 
away,  but  we  could  in  principle  proceed  in  any  order. 

Thus,  here  in  Appendix  K.3,  we  have  seen  that  at  least  for  the  kind  of  simple  scenarios 
that  arise  in  the  context  of  the  parabolic  equation  when  the  parameters  jump  (but  do  not 
vary  otherwise),  the  8  -function  bifurcating  formalism  obeys  an  associative  property.  To 
be  specific,  even  with  our  modifications  to  distribution  theory,  it  does  not  matter  which 
order  we  take  nested  derivatives  and  in  the  process  “kick  up”  bifurcating  8  -functions. 

Finally,  note  that  in  Appendix  J.2  (specifically,  the  argument  leading  to  equation  (J.  19)), 
we  came  across  an  example  where  the  associative  property  did  not  hold.  This  is 
connected  to  the  fact  that  there,  we  not  only  had  an  interface  where  the  parameters  jump, 
but  we  also  allowed  for  further  variation  of  the  parameter  p  in  the  vicinity  of  the 
interface.  Thus,  it  should  be  emphasized  that  it  is  implicitly  assumed  in  the  8  -function- 
bifurcation  rules  that  the  parameters  jump  at  an  interface,  but  do  not  otherwise  vary  in  the 
vicinity  of  the  interface.  If  this  condition  is  violated,  the  associative  property  in  choosing 
which  distributions  to  bifurcate  may  not  hold,  and  explicit  symmetrization  may  be 
necessary. 
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L  Appendix:  The  Hamiltonian  in  the  half-space  where 
the  density  is  locally  constant,  but  differs  from  the 
reference  density 

In  this  appendix,  we  derive  the  Hamiltonian  in  the  half-space  where  the  density  is  locally 
constant.  We  obtain  two  closed-form  square-root  operators  that  generate  the  same 
infinite-order  Hamiltonian.  The  square-root  operator  derived  in  Appendix  L.l  and 
presented  at  the  end  of  equation  (L.8)  follows  from  an  intuitive  derivation  and  provides 
an  easily  remembered  closed  form  operator  that  generates  the  acoustic  Foldy- 
Wouthuysen  expansion  in  the  half-space  where  the  density  is  locally  constant  (but 
different  from  the  reference  density).  However,  this  closed  form  suggests  a  grouping  of 
terms  that  provides  overly  pessimistic  convergence  criteria  for  the  expansion.  Appendix 
L.2  explores  the  correct  grouping  of  terms  in  the  expansion,  and  demonstrates  that  broad 
convergence  criteria  apply.  In  particular,  the  approach  in  Appendix  L.2  does  not  mix 
orders  in  the  correct  expansion  parameters,  while  the  expansion  in  Appendix  L.l  does. 
However,  the  closed  form  obtained  using  the  approach  of  Appendix  L.2  (equation  (L.14)) 
is  comparatively  difficult  to  work  with,  so  it  is  preferable  in  practice  to  generate  the 
finite-order  half-space  Hamiltonian  using  result  (L.8). 


L.l  Derivation  of  the  form  useful  as  a  mnemonic 

Recall  from  equation  (5.3)  that 


H  = 

O  +  £  +  kji 

£  = 

(1- 

2  kj 

)rj, 

(L.l) 

O  = 

% 

where  X  and  y  are  again  given  by  equation  (5.4),  while  the  box  again  in  this  appendix 
indicates  a  tenn  specific  to  the  case  when  the  reference  density  p0  differs  from  the  local 
density  p  .  77  and  £  are  once  again  given  by  equations  (3.3)  and  (3.6)  respectively.  For 
present  purposes,  the  properties 
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will  turn  out  to  be  useful. 


From  equations  (3.9)  and  (3.10)  with  the  (local)  range  dependence  set  to  zero,  we  have  to 


4th  order 
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f  O2  _o4^ 
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Substituting  for  O  and  S  gives 
H  = 


(L.4) 


Expanding  the  last  term: 
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using  identities  (L.2)  and  [£,[£,7]]  =  -4 7  leads  to 
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o7  3  (- [*£> tol\  W,2kjn\  -  [A^2k()m\ [A$,  Arj\  +  [Af, 2k0yj]f ) 

OKr, 


Next,  again  use  [£,[£,7]]  =  -47  and  the  fact  that  y  is  locally  constant  in  the  half-space 
to  show 
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[^[^7]]  =  7^(2  kj). 


(L-6) 


Now,  we  consider  the  terms  in  the  third  line  of  equation  (L.5).  Holding  y  locally 
constant,  we  have 

tei\ [*4, 2hVl\  =  W^Km\ [A4,  At]}  =  2 kjA3  [^,7]2  =  4  {2k j)  A3  •  1 


and 


[A£,2k0yn]2  =  4klfA2  [£7  f  =  4  (2  kj)2  A2 -l, 


so  that 
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(-  [*4,  kn\  [*4,  2K  m\  -  [*4,  2K  m\ [14,  kn\ + [*4,  2K  vfi )  = 
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Thus,  we  have  to  4th  order  (recalling  a  =  p0/  p) 
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This  is  the  result  we  are  looking  for. 


Finally  we  show  that  the  final  form  of  H  obtained  in  equation  (L.8)  is  indeed  equivalent 
to  the  known  “correct”  (i.e.,  °°  -order)  answer  for  the  Hamiltonian  in  a  half-space  where 
the  density  is  locally  constant.  We  begin  with 
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and  use  equation  (5.4)  with  the  density  locally  constant  to  obtain 
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^a=v2t 
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_k+*=„=+k_  4. 

a  a  or 
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Note  the  cancellation  leaving  us  with 


H  =  n  \  ^  =  Tjyjvl  +  k20n2  =  7)yJv2T  +  k2 


cc 


a 


(L.9) 


This  is  indeed  the  standard  -order  PE  Hamiltonian  given,  for  example  in  equation 
(9.52)  of  reference  [57].  It  is  completely  equivalent  to  equation  (3.11)  above  with  the 
(local)  range  dependence  set  to  zero  (the  square-root  operator  there  also  reduces  to 

+  k2  ).  When  there  is  no  (local)  range  dependence,  we  can  also  use  the  PE 

Hamiltonian  to  propagate  the  pressure  field  A  ,  and  equation  (L.9)  then  also  recovers 
equation  (2.4)  above.  We  have  therefore  shown  that  in  the  half-space  where  the  density 
is  locally  constant,  the  Foldy-Wouthuysen  procedure  generates  a  PE  expansion  of  the  PE 
square  root  operator,  where  the  expansion  parameters  now  measure  the  departure  of  the 
local  values  of  both  the  sound  speed  and  density  from  their  reference  values. 


L.2  Heuristic  derivation  of  a  form  that  accurately  reflects  the 
convergence  properties  of  the  expansion 

Appendix  L.2  considers  the  convergence  properties  of  the  series  for  the  Hamiltonian  in 
the  half-space.  In  Subsection  L.2.1,  we  demonstrate  that  the  convergence  criteria 
obtained  by  naively  expanding  the  closed  fonn  of  the  scalar  Hamiltonian  obtained  in 
Appendix  L.l  are  quite  strict.  In  Subsection  L.2. 2  we  use  general  arguments  to  obtain  an 
alternate  closed  form  for  the  Hamiltonian  in  the  half-space  that  suggests  that  these 
convergence  criteria  are  too  strict.  Subsection  L.2. 3,  we  demonstrate  that  the  looser 
convergence  conditions  given  in  Section  L.2. 2  are  indeed  the  correct  ones.  We  begin  by 
grouping  terms  in  such  a  way  that  we  preserve  the  intra-order  cancellations  that  are 
present  in  the  Foldy-Wouthuysen  expansion  and  are  needed  to  generate  its  true 
convergence  properties.  The  resultant  closed  form  manifestly  incorporates  these 
cancellations  and  thus  reflects  the  true  convergence  properties  of  the  expansion. 


L.2.1  Why  there  is  an  issue  here? 

The  result  for  the  scalar  Hamiltonian  H : 

h  =  K  l  +  2ort 
cc  ]j  k0 

derived  in  Appendix  L.l  must  be  interpreted  as  an  expansion  in  the  dimensionless 
operator  2/l/k0  and  the  dimensionless  scalar  function  2  y  (which  comes  from 

a  =  l/(l  -  2y) ).  Used  properly,  this  expansion  is  taken  order  by  order  with  these 
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expansion  parameters  (and  more  generally  also  the  downrange  derivative  d/dx  acting  on 
A)  contributing  equally  to  the  order.  In  this  sense  equation  (L.10)  is  a  mnemonic  that 
encapsulates  this  expansion.  We  have  to  be  careful  about  taking  this  mnemonic  too 
literally.  Specifically,  we  cannot  treat  the  operator  2aAjk0  as  an  expansion  parameter. 


To  see  why,  note  that  if  the  density  p  and  the  compressibility  K  are  range  independent, 
then  we  can  decompose  the  solution  into  eigenvectors.  For  each  eigenvector,  we  can 
replace  the  operator  H  with  its  eigenvalue  kx  (the  downrange  wave  number  of  the 
eigenvector  solution)  so  that 


^o_ 

a 


2a A 


and  so  noting  that  k]  =  c op0K0 ,  k2  =  copK  and  as  always  a  -  p0/ p ,  we  have 

t  I  2  aA  _  k;  ^  _  k;  p0  K 
K  K  k2  p  K0 


Since  kx  is  the  x  -component  of  the  wavevector  for  the  waveguide  eigenvector  solution, 
and  it  is  real  since  the  finite-order  Hamiltonian  is  Hermitian,  we  can  write  k  jk  =  cos#, 
where  0  is  the  grazing  angle  corresponding  to  that  particular  solution.  This  leads  to 


2  aA  2  n  Po  K 

- =  cos  6  ■  — - 1 

K  P  K0 


(L.  11) 


To  maximize  convergence,  we  typically  choose  the  reference  compressibility  and 
density  K0  and  p0  to  be  the  maximum  values  of  these  parameters  found  in  the  problem. 

A  typical  problem  would  involve  an  interface  between  water  (labeled  by  the  Roman 
numeral  I)  and  a  muddy  ocean  bottom  (labeled  by  II).  Under  this  scenario,  the  higher 
compressibility  is  the  value  in  water  (so  K0  =  K j ),  and  the  higher  density  is  found  in  the 

muddy  bottom  (so  p0  =  pn  ).  Thus  for  eigenvalue  solutions,  we  have 


2a A 

K 

2a A 

K 


cos2  0  ■  — - 1  in  medium  I 

Pi 

cos2  9  — —  - 1  in  medium  II 
K, 


(L.12) 


Now,  we  have  an  interesting  problem:  for  the  relatively  common  scenario  pu  >2/7,  and 
shallow  grazing  angle  ( 0  small;  cos(#)  ~  1 ),  our  apparent  expansion  parameter  2aA/k0 
becomes  greater  than  one.  One  is  tempted  to  conclude  that  the  validity  of  our  PE  is 
limited  to  the  region  pn  <2p; .  However,  as  discussed  in  Section  2.3,  the  physics  of  the 
problem  imposes  no  such  limit.  We  are  forced  to  conclude  that  the  apparent  divergence 
of  the  expansion  is  an  artifact  of  the  way  the  terms  have  been  lumped  together  into  the 
nominal  expansion  parameter  2 aA/k0  .  In  the  Subsections  L.2.2  and  L.2.3  we  provide 
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confirmation  of  this  insight  by  explicitly  constructing  an  alternate  grouping  of  terms  that 
is  specifically  designed  to  accurately  reflect  the  true  convergence  behavior  of  the 
underlying  PE  expansion.  We  will  find  that  convergence  to  the  correct  result 

h  =  J¥t  +  k 2  is  manifest  for  all  A pj p0  <  1 . 


L.2.2  The  basic  strategy 


In  the  upcoming  Subsection  L.2.3  that  immediately  follows  the  current  subsection,  a  new 
grouping  of  terms  will  be  constructed.  This  grouping,  by  design,  will  reflect  the  true 
convergence  properties  of  the  PE  expansion  generated  by  the  Foldy-Wouthuysen 
transformation.  The  procedure  will  begin  with  the  observation  that  the  actual  order-by¬ 
order  (in  2A/k0,2y)  expansion  generated  by  the  Foldy-Wouthuysen  transfonnation 

contains  cancellations  that  are  associated  with  cross-terms  that  come  from  taking  powers 
of  A  -  y .  (On  the  other  hand,  if  we  let  the  derivation  in  Appendix  L.  1  influence  us  to 

consider  2aA/k(]  as  an  expansion  parameter,  then  we  would  be  combining  terms  across 
orders  in  the  original  expansion  (i.e.,  in  the  expansion  in  the  parameters  2A/k0  and  2 y) 
in  a  way  that  does  not  take  advantage  of  these  intra-order  cancellations.  Crucial 
cancellations  would  occur  between  different  orders  in  the  2aAlkl)  -expansion  rather  than 
within  a  given  order.)  The  intra-order  cancellations  will  be  used  to  guide  us  in  the 
construction  of  an  alternate  “mnemonic”  (i.e.,  a  compact  expression  that  can  be  used  to 
generate  the  expansion).  Since  the  cancellations  are  built  in,  this  alternate  form  should 
accurately  reflect  the  true  convergence  properties  of  the  expansion. 


As  it  turns  out,  we  could  almost  have  guessed  the  result.  It  reflects  the  fact  that  (locally 
at  least)  we  are  free  to  make  use  of  the  familiar  PE  operator  A  and  reference  wave 
number  k0 ,  where  the  reference  density  is  temporarily  chosen  to  be  the  local  density  (i.e., 
Po  =  P\  ocai  =  P  where  p0  is  the  reference  density  used  briefly  during  the  argument 

employed  to  construct  equation  (L.  14)  below),  and  the  difference  between  the  reference 
and  local  compressibilities  generates  the  familiar  deviation  of  the  index  of  refraction 
squared  (i.e.,  K0  =  K0).  Thus: 


K  =  c °4^P  = 


co 

cn 


jU=jU  =  - 
2 


i -2L 


K, 


o  J 


\_]Le2 

K„P 


V 


A  =  -A-  -  k0/i 
2kn 


1  -ZL 


-2\ 

0_ 

2 


<lj3> 


Within  the  half-space  (where  the  density  is  constant),  A 
locally  exact  PE  operator: 


and  k0  can  be  used  to  express  the 
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H  =  La  1  +  ^A  =  Jvi+k2 

1  K  v  r 


(L-14) 


As  noted  earlier,  the  latter  is  equivalent  to  the  mnemonic  given  in  equation  (L.10).  Using 
the  relations 


A 


2k- 


2co2K0p 


Po 


2cd  K0p0  p 


-P 


(  P^ 


\P  J 


2k- 


-M 


A 


~7 


K  =  a^[K^P  =  —  =  kjl~27 

V  Po 


(L-15) 


the  new  compact  expression  in  equation  (L.14)  can  be  expressed  and  then  expanded  in 
the  quantities  2A/k0  and  2 y.  Since  for  sufficiently  small  2A/k()  and  2/,  even  the 
expansion  derived  using  equation  (L.10)  clearly  converges,  the  expansions  of  (L.10)  and 
of  (L.14)  must  be  identical  at  any  given  order  in  2A/k0  and  2  y .  Thus,  we  have  a 
compact  expression  for  the  expansion  generated  by  the  Foldy-Wouthuysen 
transfonnation  that  implies  good  convergence  properties  as  long  as  2A/k0  and  2  y  are 

both  less  than  1  (i.e.,  for  all  densities  provided  we  choose  p0  =  /?max).  The  heuristic 

development  in  Subsection  L.2.3  roughly  proceeds  in  the  reverse  direction  of  the 
argument  in  this  paragraph.  It  is  useful  to  consider  it  in  detail  because  this  derivation 
clearly  illustrates  why  expression  (L.14)  more  accurately  reflects  the  true  expansion 
properties  of  full  PE  expansion. 


L.2.3  Detailed  heuristic  analysis 

Here  we  consider  the  PE  expansion  for  the  case  when  the  reference  density  differs  from 
the  local  density,  and  cast  it  in  a  fonn  where  the  natural  pairing  of  A  and  y  into  the 
combination  A  —  y  is  manifestly  apparent.  We  begin  by  grouping  the  1st  order  of  the 
expansion  in  the  following  suggestive  manner: 

K  ~ 2KY  +  A  — >  k0  +  (A- k0y )  - k0y  +■■■ . 


This  suggests  that  we  try  an  ansatz  with  Aeffective  =  A  -  k0y .  In  other  words,  we  anticipate 
a  result  of  the  form 


H  =  h. 


rh 


{  ,  2 (A-k0y)  | 
kn 


(L.16) 


Let  us  see  if  the  ansatz  works  to  2nd  order.  The  2nd  order  expansion  of  equation  (L.16)  is 
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and  so  we  have  reproduced  result  (L.  14). 

We  have  shown  that  the  PE  expansion  when  the  reference  density  differs  from  the  local 
density  is  generated  by: 

■  taking  the  generator  of  the  PE  that  is  commonly  used  when  the  density  is  tacitly 
assumed  to  be  everywhere  the  same  (for  the  moment  using  the  local  density  as  the 
reference  density), 

■  using  equation  (L.  17)  to  express  the  associated  reference  wave  number  k0 

( =  WyjPi ocai^o  )  anc*  the  related  operator  A  in  terms  of  our  usual  reference  wave 
number  k0  ( =  a>^ p0K0  )  as  well  as  in  the  expansion  parameters  associated  with 
the  variable  density  PE  (i.e.,  A,/), 

■  and  then  expanding  in  terms  of  these  new  expansion  parameters. 

This  result  applies  in  the  half-space  (or  more  generally,  any  finite-size  area)  where  the 
density  is  (at  least  locally)  constant,  and  only  the  reference  compressibility  changes  to 

vary  the  sound  speed.  Also  note  that  equation  (L.14)  (H  =  k0-Jl  +  2A/k0  )  leads  to  an 

expansion  in  the  nominal  expansion  parameter  2^/k0 ,  which  has  decent  convergence 
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properties  provided  that  the  reference  compressibility  K0  is  greater  than  the  local 
compressibility  K ,  and  the  grazing  angle  is  reasonably  shallow.  The  nominal  expansion 
parameter  2X/ k0  has  embedded  within  it  the  key  cancellation  generated  by  taking  the 
difference  X  -  k0y .  The  convergence  properties  related  to  this  nominal  expansion 

parameter  are  therefore  much  more  indicative  of  the  true  convergence  properties  for  PE 
expansion  (generated  by  the  FW  transformation  for  the  variable  density  problem)  than  are 
the  convergence  properties  that  apply  when  the  nominal  expansion  parameter  is  2 aXj k0 , 
(the  expansion  parameter  that  emerges  most  naturally  from  expression  (L.10)). 


This  leaves  us  with  the  obvious  question:  why  should  we  bother  with  the  generator  from 
equation  (L.8) 


at  all?  This  result  is  useful  because: 

■  Its  derivation  is  the  most  transparent  and  direct. 

■  The  result  is  nicely  compact  and  easy  to  work  with. 

o  It  is  useful  as  a  mnemonic 

o  When  calculating  the  endpoint  correction,  this  form  very  naturally 
generates  the  compact  result 


X  = 


f 

1  + 

v 


■A. 


For  these  reasons  we  will  henceforth  primarily  use  this  form,  being  careful  to  use  it 
correctly:  i.e.,  to  use  it  to  generate  an  expansion  in  2 y  and  X .  Both  these  parameters 
contribute  equally  to  the  power  counting,  and  we  must  be  careful  not  to  “bust  up”  orders 
of  X ;y. 


M  Appendix:  The  boundary  conditions  for  quasi-lst 
order  deterministic  variable  density  theory 


In  this  appendix,  we  derive  equation  (5.16),  the  boundary  conditions  for  detenninistic 
quasi-lst  order  acoustic  theory,  where  the  density  and  compressibility  jump  at  an 
interface.  This  calculation  nicely  illustrates  many  of  the  ideas  developed  in  Section  5, 
and  so  it  will  be  presented  in  great  detail.  We  operate  in  standard  2-dimensional  x  —  z 
space.  There  are  two  regions  labeled  /  and  II  separated  by  the  rough  surface  z  =  f{x). 
The  density  p  and  compressibility  K  are  constant  in  the  two  regions,  but  both  quantities 
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jump  along  the  interface  separating  the  two  regions.  The  positive  z  -axis  points  from 
Region  II  into  Region  /  .  The  starting  point  is  the  standard  scalar  parabolic  equation 

-M  =  Hz  (M.l) 


where  H  is  given  by  equation  (5.15) 


H  =  k0  +  A-2  k0y- 


A 

8  kl 


From  equation  (5.4),  we  have  the  definitions 


f 


A- 


jU 


1 

y=- 

2 


A - koJu  +  k0y 


2  K 

1-A 

V  K„y 

\-JL' 

v  Ay 


(In  the  2-dimensional  problem,  Vr  =  d/dz  .) 

P  =  Pa -Sp®(z-f) 
y=yn-Sy-©{z-f) 

<U  =  jUn-SjU©(z-f) 


We  also  have 

Sp  =  P„  ~  Pj 
where  Sy  =  yn  -  y,  . 
Sp  =  p„-p, 


(M.2) 


(M.3) 


(Note  that  these  definitions  are  a  little  more  general  than  the  ones  that  are  usually 
employed.  The  usual  special  case  will  be  considered  at  the  end  of  the  calculation.) 


We  will  evaluate  equation  (M.2),  and  the  S  -function-bifurcation  rules  outlined  in  Section 
5.2  will  be  used  as  needed  to  properly  interpret  the  results.  Then,  the  result  will  be 
substituted  into  the  basic  parabolic  equation  (M.  1).  To  obtain  the  boundary  conditions,  a 
pair  of  infinitesimal  transverse  integrations  of  the  type  described  in  Subsection  3.3.3  will 
be  applied  to  this  equation. 


Thus,  begin  by  differentiating  A  (as  always,  a  dot  means  the  downrange  derivative 
d/dx ): 


A  = 


Po^  T 


A 


P 


2  K 


-koJu  +  kj. 


(M.4) 


Now, 
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and  similarly 


p  =  -8p(-f)8(z-f)  =  f-Sp-8{z-f) 


y=f-8yS(z-f)  ;  fi  =  f  ■  Sp- S(z- f) . 


8(z-f)  of  course  bifurcates,  but  8p  ,  Sy  and  8/u  are  just  old-fashioned  constants,  and 

in  this  context  /  might  as  well  be.  Bifurcation  only  becomes  an  issue  when  we  multiply 
by  another  distribution,  and  so  at  this  point  the  issue  only  affects  the  term  in  (M.4)  that  is 
proportional  to  Sp  .  In  this  term,  we  have 


p-f5p 


-  +  - 


2  p  2pn 


and  so 


A= 

2k0  ■  2 


(  x+ 


S+{z~f)  | 

Pi, 


V  Pi  t'a  J 

- k0-8ju-f  8{z-  f)  +  k0-8y- f  8{z-  f ) 


Will  bifurcate 
later 


Will  bifurcate 
later 


(M.5) 


Taking  one  more  downrange  derivative: 


X=  p*f'5pv 

2  kn 


S+(z-f)  +  S~(z~f) 


2  Pi 


2  Pi 


Pof  ■  8P 
2  kn 


r  r 


vr  \ 


(-/ )  s,+ (z-.n+ (_-/ )  s'-  (z~f) 


2  Pi 


2  pi 


+  H'o  •  SP  +  K  ■  Sy) / (-/) 8\z -  f)  +  (-k0  Sju  +  k0-Sy)f  S(z-  f ) 
_-PofSPv 


Will  bifurcate 
later 


Will  bifurcate 
later 


(M.6) 


2  K 


+  Pof  -  Sp  v 
2  kn 


S+{z-f)_+5~{z~f) 


2  p; 


2  pi 


f S'+(z-f)  +  S'-(z-f ) 


V  2  P/  2  pn  j 

+  (k0-Sp-k0-Sy)f2S'{z-f)-(k0-8p-k0-8y)f  S(z-f). 


Will  bifurcate 
later 


Will  bifurcate 
later 
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Now  consider  Xx  ■  First  note  that  bifurcation  gives  8^,]±  x  =  8(")±x±  ■  Then  convert 
distributions  times  functions  of  the  transverse  coordinate  z  to  distributions  times 
“constants”  with  respect  to  z  that  depend  only  on  / (x)  using  equation  (3.54)  for  k  =  0 
or  k  —  1  (the  related  footnote  11  explicitly  evaluates  these  two  expressions): 


-  +  - 


2  ft  2  pi 


^tZ{Z)  ~  -  2  ^ tXi  (Z)  +  j  2  ^  tZiI  (Z) 

APi  A pn 

APi  A Pn 


5"  vrZ(A=4XvrX,(A  =  ^rZ,(/)-XivrX,(f) 


2  p) 
8' 

2  Pi 


2  tf 


2  Pi 


2  ft 


^tX(A  =  ^tX„  (A  =  XtVtZ„  (f)~V2rX„  (/)• 

^P II 


2  P)i 


2p2u 


Adding  the  last  two  equalities  gives  us 

^/+  s,+ 


v  2/?; + 2Pn  j 


Vr^(z) 


,nrv^,(/)+Arvri'„(/) 

A  Pi  A  p 'a 

'  ^\x,(f)+^\x,Af) 

A  Pi  A  Pu 


Similarly,  we  have 


f8'+  S,+  ^ 

+  J 


z(z)  =  ^pZAf)  +  ^yZAf) 


vva(/)4v*(/i' 


Now,  let  us  adopt  the  following  shorthand  notation: 


2fvrZ,(/)+Arv^„(/)=iy;^vr21(/) 

ZPi  zpn  Z  ±  p± 


VtX,  (/) + Ava  (/)  =  i^vrZl  (/) 

zpi  zpn  z  +  p± 


22v^,(/)+Arv^„(/)=iy;4v^1(/) 

zPi  zPn  z  ±  P± 


etc. 


Using  these  results  to  evaluate  equation  (M.6),  we  have 
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4ar= 


_  Pof  ■  Sp  . 


2kn 


+ML±y7 

2k0 

pj2  ■  Sp  v 
2t, 


^rXAf) 

_A  ±  y± 

\l,  £vr*±(/) 

A  ±  H± 

z  ±  y± 


+  (k0-dp-k(>  Sy)f 
-{k()-Sfi-k()-Sy)f 
-{k0-S/u-k0-Sy)f 


(M.7) 


In  the  first  three  terms  to  the  right  of  the  equals-sign  (=),  the  only  functions  of  the 
transverse  coordinate  z  are  the  S  and  S'  -functions,  and  so  we  can  operate  on  these  by 
the  transverse  derivative  VT  =  d/dz  .  This  gives  us 


Pof  ■  8P  y 
2k0  7 

Pof2'  ^P  y 

2k0  T 

Pof2  ■  Sp  v 

2kn 


_A  ±  y± 

\ZSf*r  Xt(f) 

z  ±  H± 

z  ±  H± 


— >- 


Po  f  ■  Sp 


2L 


Pof2  Sp 


2k0 
Pof2 -Sp 


2  L 


-2Hfvrxt(f) 

z  ±  H± 

ivrw  " 

j  X  -2  Vr/T±(/) 

_  ±  y± 

A  ±  y± 


(M.8) 


Result  (M.7)  as  modified  by  (M.8)  will  be  substituted  into  the  quasi-lst  order  parabolic 
equation  for  the  right-hand  traveling  (i.e.,  downrange)  field  % .  This  parabolic  equation 
comes  by  combining  (M.l)  and  (M.2)  to  form 

(M.9) 


-iA  =  hx+ty-2k()YX- 

ox 


8k: 


As  discussed  in  Section  3.3.4,  we  ignore  the  8 -function-like  singularity  associated  with 
dx/dx  primarily  because  it  is  of  higher  order  than  concerns  us  here  (it  involves  three 
downrange  derivatives  of  A ,  and  so  it  is  effectively  a  4th-order  term).  Furthermore,  this 
tenn  is  an  artifact  of  our  rather  artificial  quasi- 1st  order  theory. 

Since  the  lead  order  transverse  derivative  in  A  is  divided  by  2 k0 ,  we  multiply  the  wave 
equation  (M.9)  through  by  2 k0 ,  leaving  us  with 
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0  =  2  kM-^-z  + 

4k0 


terms  that  play  no  role  in 
the  boundary  value  problem 


From  definition  (M.3),  we  have 


2k0Az  =  Vr- 


VP) 


VTX-2klnx  +  2klyx . 

These  terms  are  not 
players  in  the  boundary 
value  problem 


(M.10) 


(M.l  1) 


Next,  multiply  (M.7)  as  modified  by  (M.8)  by  — 1/(4 k0) ,  and  substitute  the  result  as  well 
as  (M.l  1)  into  (M.10): 
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9  X  n2  ^rZ±(f) 

z  ±  H± 

|EfrV^(/) 

z  ±  y± 


r,(/) 


terms  that  play  no  role  in 
the  boundary  value  problem 


(M.12) 


Now  follow  the  transverse-integration  procedure  described  in  general  terms  throughout 
Section  3.3.3.  For  the  case  directly  analogous  to  the  present  situation  use  both  single 
integration  as  described  in  equation  (3.56)  and  double  integration  described  in  the 

f+e 

equation  immediately  preceding  it.  The  single  infinitesimal  integration  J  dz  •  •  •  of 

f-e 


equation  (M.  12)  gives  us 
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0 = — v,x,  -—v,Xu  Sy)f2^rZ‘ </)+ VrZ" (/) 

Pi  Pii  4 


Vrf  I 


+^{Sju-Sr)/Zl^+Zll(<f) 


(M.13) 


f+e  z 


Now,  let  us  perform  the  double  integral  J  dz  j  dz--- .  We  do  so  in  slightly  different 


f-e 


form:  as  a  sequence  of  an  indefinite  integration  followed  by  a  second  definite  integration 


f+e 


J  dz---.  First,  perform  the  indefinite  integration  and  then  multiply  by  p/pQ: 
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1'fr'L&PjrXAf) 

AP()  ± 


f-'L&padf) 

2Po  ± 


f+e 

and  now  the  infinitesimal  integral  J  dz---  gives  usdddd: 

f-e 


dddd  Note  the  term  in  X  that  was  proportional  to  8  did  not  contribute  to  the  boundary  conditions  for 
quasi- 1st  order  theory.  This  is  because  this  O  X'j  theory  is  a  bit  of  a  hybrid.  Once  we  introduce  an 

term  (let  alone  going  to  true  3rd  order  theory  with  the  ?i  -term),  then  the  term  proportional  to  8" 
will  contribute.  Historically,  contemplating  this  term  was  the  catalyst  that  made  the  author  consider  the 
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Equations  (M.13)  and  (M.14)  gives  us  the  boundary  conditions  on  J : 
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Now  let  us  make  the  standard  choices  for  the  reference  values  of  the  density  and 
compressibility.  This  follows  Figure  5.1  and  the  related  discussion  in  Subsection  5.3.2. 
Specifically,  in  order  to  guarantee  that  the  parabolic  equation  expansion  parameters  // 
and  y  be  less  than  1,  we  choose  for  the  maximum  values  for  the  reference 
compressibility  and  density.  In  our  prototypical  example  where  Region  /  is  the  ocean 
water  column  and  Region  II  is  the  sediment  in  the  ocean  bottom,  the  compressibility  in 
Region  /  is  the  largest  and  the  density  in  Region  II  is  the  bigger  value.  Thus,  the 
reference  compressibility  is  K0  =  KI  and  the  reference  density  is  p0  =  pn  .  This  gives  us 
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and  so  the  boundary  conditions  (M.  15)  become 
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issues  of  migrating  boundary  conditions  (Section  4.2.2)  and  thus  shortly  thereafter,  the  key  result  of  the 
whole  effort:  Bragg-scale  vorticity  (Section  6). 
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This  is  just  equation  (5.16),  the  boundary  conditions  for  deterministic  quasi- 1st  order 
acoustic  theory,  where  the  density  and  compressibility  jump  at  an  interface. 


N  Appendix:  A  closer  look  at  the  higher-order  range- 
dependent  terms  in  the  PE 


As  frequently  discussed  above,  the  Foldy-Wouthuysen  procedure  generates  a 
Hamiltonian  that  includes  a  unique  series  of  terms  explicitly  associated  with  the  range 
dependence.  It  was  noted  in  Section  4.4  that  the  members  of  this  series  of  terms  that  are 

proportional  to  X  correspond  to  the  phenomenon  of  “vacuum  polarization”  known  from 
quantum  mechanics  (because  this  type  of  tenn  involves  virtual  particle  pairs).  The 

lowest-order  member  of  this  class  of  terms  is  the  3rd-order  term  -Xj(%kl ) .  It  was  just 


evaluated  in  Appendix  M  (and  the  results  were  summarized  in  Subsection  5.3.2). 
Examining  equation  (J.24)  (or  equivalently  (5.9)),  we  see  that  there  are  also  two 

nominally  4th-order  terms  that  contain  X  : 
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In  Section  N.l,  we  estimate  the  magnitude  of  these  terms  for  typical  values  of  the 
environmental  parameters,  and  verify  that  they  are  indeed  getting  smaller.  The  form  of 

the  boundary  conditions  that  arise  from  the  -(Ay) Xj(%kl )  term  suggests  an  infinite 

series  encountered  previously.  In  Section  N.2  this  infinite  series  is  evaluated  in  closed 
form,  and  the  appropriate  limit  is  taken  to  examine  the  Dirichlet  =  0 )  and  Neumann 

( d%/dn  =  0  for  the  full  wave  problem,  but  this  changes  a  little  bit  for  the  parabolic 
equation)  boundary  conditions.  The  Neumann  boundary  conditions  for  the  parabolic 
equation  imply  the  existence  of  curvature-induced  boundary  waves. 


N.l  An  estimate  of  the  magnitude  of  the  4th-order  terms  in  the 
“vacuum  polarization”  series 


To  help  us  develop  a  feel  for  the  nature  of  the  convergence  of  the  series  of  terms  newly 
predicted  by  the  Foldy-Wouthuysen  procedure,  this  section  examines  the  magnitude  of 
the  4th  order  tenns  relative  to  that  of  the  3rd  order  tenn.  For  the  sake  of  this  comparison, 
we  take  out  a  common  factor  in  the  relevant  terms  (taken  from  equation  (J.24)  (or 
equivalently  (5.9)): 
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Now,  only  the  terms  in  the  parenthesis  need  to  be  compared.  To  within  a  constant,  these 
tenns  appear  in  a  Hamiltonian,  and  so  it  is  of  course  always  understood  that  they  are 
operating  on  the  wave  function  % . 


N.1.1  An  estimate  of  the  magnitude  of  the  [A, A)  -term 


Recall  that  j/l,  /L }  jk0  =  {  A ,/k0  )■  A  +  A- (A/k0) .  Let  us  evaluate  (A/k0)-  A%  first.  W  e  have 

A  from  equation  (M.7).  All  terms  contain  bifurcated  8{")  -functions  ( n  >  0  ).  Thus, 

A/k0  will  be  evaluated  in  the  half-spaces.  From  the  definition  (5.4)  and  equivalently 
(M.3),  A/k0  in  the  half-spaces  is  the  sum  of  the  operator 
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and  the  operator  —ju  +  y .  From  equation  (M.7),  the  former  operating  on  /(^generates 
8[ii)  functions  where  now  n  >  2  .  Now,  Su)  =  8"  integrates  to  zero  for  quasi-lst  order 
theory  (since  A  also  generates  the  leading  order  derivative  ),  so  in  this  context  we  can 

forget  about  all  these  terms.  This  leaves  us  with  —p  +  y  multiplying  Ax : 

(■ A/k0)-Az^{~p  +  y)Az . 


Since  A  contains  only  bifurcating  (71'1  -functions,  the  factor  (~p  +  y)  operators  just  like 
a  constant,  so  we  can  commute  A  and  (-//  +  y)  to  obtain: 
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Now,  let  us  flip  things  around  and  consider  A(A/k0)-  X  ■  Again  we  pick  up  two  kinds  of 
terms: 
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Combining  results  (N.3)  and  (N.4),  we  have 
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Note  that  the  factor  of  2  in  front  of  (-ju  +  y)z  occurs  because  both  halves  of  the 

commutator  contribute  to  this  term.  Equation  (N.5)  simply  gives  us  all  the  terms  in 
equation  (M.7)  (as  modified  by  equation  (M.8))  with 
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Terms  proportional  to  S{n)  ( n  >  2)  which 
do  not  contribute  to  quasi- 1st  theory 


(N.6) 


Note  that  in  quasi- 1st  order  theory,  we  throw  out  the  terms  proportional  to  S" . 

Equation  (N.  1)  tells  us  that  to  compare  the  4th-order  -contribution  with  that  from 

the  3ld-order  term  A ,  we  will  need  to  multiply  equation  (N.6)  by  ~y2 .  Now  we  find  that 
equation  (N.6)  contains  new  4th-order  tenns  that  relate  to  corresponding  ones  in  the  3rd- 
order  contribution  in  one  of  two  basic  ways: 


:  Note  that  when  differential  operators  are  to  the  right  of  8  -functions,  we  must  of  course  first  take  the 


derivatives  to  obtain,  for  example,  V"x(z)  ,  and  only  then  convert  from  a  function  of  z  to  a  constant  that 
is  nominally  evaluated  at  Z  =  f  . 
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■  the  operator  ([-3AT±]/[4^0])  (V2  / k\ )  has  been  inserted  to  operate  on  the  wave 
function  x±  ■>  or 

■  there  is  an  extra  factor  3(jU±-y±)  (this  factor  simply  multiplies  the  wave 
function  X±  or  its  transverse  derivative  V”x±  where  n  =  1,2). 

Now,  let  us  pick  credible  parameters  for  the  ocean  bottom.  We  will  use  values  for  sand, 
or  sandy  mud.  Then 

Kw,ter=KI=K0  =  3KI1=3KsanA 

2  P  water  Pi  —  P0  —  P  u  —  P Sand 


If  we  set  cwater  =  1500  m/s,  then  the  ratios  (N.7)  correspond  to  csand  ~  1840  m/s.  The  ratios 
(N.7)  also  give  us 
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Now,  consider  the  first  type  of  term  listed  above.  In  typical  problem  where  the  parabolic 
equation  is  used,  there  is  long-range  propagation  down  a  duct.  If  we  decompose  the 
wave  X±  traveling  down  the  duct  into  modes,  we  find  that  the  shallow  grazing  angle 
modes  dominate  downrange  propagation,  and  the  assumptions  behind  the  low-order 
parabolic  equation  are  easily  satisfied.  Modes  with  a  grazing  angle  on  the  order  of 
10  deg  is  typical,  and  for  such  modes,  V^,/A:+  brings  down  the  sine  of  the  grazing  angle 
squared:  sin2  ;  ~  sin2  (lOdeg)  ~  0.03  .  Multiplying  by  the  factor  given  by  the  first 

line  of  equation  (N.8),  we  find  that  in  both  half-spaces,  terms  of  the  first  sort  will  provide 
a  correction  on  the  order  of  l%-2%.  This  is  negligible,  and  we  are  very  safe  in  ignoring 
these  terms. 


Next,  consider  terms  of  the  second  type  listed  above.  We  have 
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(Equality  (N.6)  contains  a  subset  of  all  possible  permutations  on  n  =  0, 1, 2;m  =  0,1 .)  We 
will  throw  out  terms  that  have  an  overall  power  of  (i.e.,  a  power  of  y1  ,/u1  relative 


284 


to  the  corresponding  term  in  3rd-order  (A)  theory).  This  is  completely  acceptable, 
because  these  tenns  also  contain  two  downrange  derivatives  and  so  they  are  5th  order  by 
our  power  counting.  Recall  that  we  have  already  thrown  out  all  tenns  of  this  order. 

Now,  note  that  if  y,ju  — >  0,  then  % ,  VTx  and  V2T%  are  all  continuous.  Thus, 
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We  also  have,  pt  =  pu  +  O  (y) .  This  allows  us  to  make  the  substitution 
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and  pull  these  factors  out  of  (N.9).  Also  note  that  S  p  -  S+y+  integrates  just  like 
(ju_-y+)S ,  and  so  we  can  make  this  replacement  as  well  (cf.  the  discussions  a  little 

below  equation  (K.18)  and  surrounding  (K.21)).  Thus,  we  can  collapse  the  bifurcation 
and  pull  out  the  common  factors  to  get 
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In  other  words,  the  4th  order  contribution  is  picking  up  an  extra  factor  %(p_~  y+) 


relative  to  the  comparable  3rd-order  term.  Now,  from  equation  (N.8),  we  have 
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or  a  12.5%  correction.  This  is  not  too  bad. 


285 


Now,  let  us  perform  a  similar  analysis  on  the  term  proportional  to  4 y-  X . 

N.1 .2  An  estimate  of  the  magnitude  of  the  y-X -term 

Next,  let  us  estimate  the  magnitude  of  the  term  proportional  to  4  y-  X .  Again  using 

equation  equation  (M.7)  (as  modified  by  equation  (M.8))  to  substitute  for  X ,  we  have 
(with  y~  =  yu=  0) 
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Note  that  the  tenn  proportional  to  8"  will  not  contribute  to  quasi- 1st  order  theory,  and  so 
we  again  throw  it  out.  Also  once  again  throwing  out  terms  that  have  an  overall  power  of 
y\//’  (and  are  hence  5th  order  by  our  power  counting  since  these  terms  also  contain  two 
downrange  derivatives),  we  can  use  equation  (N.  10)  (and  p,  =  pn  +  O  ( y) )  to  get 
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With  /+  =  1/4  (from  equation  (N.8)),  this  is  a  50%  correction.  The  terms  are  getting 
smaller,  but  this  is  a  pretty  substantial  correction.  In  many  practical  instances,  it  may 
prove  necessary  to  use  higher-order  theory  to  correctly  evaluate  this  term,  so  in  Section 
N.2  we  will  attempt  to  extrapolate  the  result  to  infinite  orders,  and  examine  its 
implications. 


N.2  Extension  to  infinite  orders  and  the  implications  for  the 
Dirichlet  and  Neumann  Boundary  Conditions 


This  section  provides  a  preliminary  survey  of  one  of  the  possibilities  of  the  new 
formalism,  and  so  we  will  allow  ourselves  to  use  mathematical  reasoning  that  is  a  little 
bit  looser  than  that  generally  employed  elsewhere  in  this  study. 


As  in  Section  N.  1 .2,  we  have 
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Under  these  circumstances,  we  have  the  sequence  of  tenns: 

(1+2^' 


(N.13) 


which  contains  the  very  suggestive  factor  (1  +  2/+) .  This  looks  like  the  beginning  of  the 


familiar  series  (e.g.,  see  equation  (5.13)) 
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With  y  =  /+  ,  we  have  p()  =  pn  and  p  =  pI .  All  this  suggests  that  at  infinite  order,  the 
series  beginning  with  (N.13)  will  converge  to 
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In  quasi- 1st  order  theory,  the  lead  derivative  is  generated  by  A /kQ .  From  equation  (N.2), 
we  can  write  the  tenn  responsible  for  the  lead  derivative  as 
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which  is  independent  of  p0 ,  and  so  it  is  the  same  as  the  corresponding  °°  -order  in  y 

operator.  Thus,  it  is  consistent  to  generate  boundary  conditions  by  pairing  this 
differential  operator  with  the  (also  °°  -order  in  y)  boundary-condition-generating  term 
(N.  14).  Doing  so  effectively  leads  us  to  take  the  boundary  conditions  in  equation  (M.  16), 
identify  the  tenns  that  are  associated  with  X  by  noting  that  they  are  the  ones  proportional 
to  /  and  j  ,  and  multiplying  these  terms  by  pn  / p,  .  This  gives  us 
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where  we  have  used  fy  •  Pl/Pn  =  coKIpII  ■  P,/Pu  =  coKIpI  =  k]  .  To  get  the  Neumann  and 
Dirichlet  boundary  conditions,  we  will  let  both  pu  »  p,  and  Ku  K t ,  and  look  at  this 
problem  from  the  point  of  view  of  the  two  different  half-spaces. 

To  get  the  Neumann  boundary  conditions,  use  the  second  boundary  condition  in  equation 
(N.15)  as  viewed  from  the  point  of  view  of  the  field  in  Region  /  .  Thus,  set  pu  — >  and 
Kn  —>  0 .  Taking  these  limits  in  equation  (N.12)  gives  us 
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Now  the  lower  boundary  condition  in  (N.15)  becomes 
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Since  cn  is  finite  ( pu  and  Kn  —>  0  roughly  cancel),  kn  is  finite,  and  so 

VTZu  kn  ■  field  amplitude  ^ 

Pu  Pn 

Since  pt  is  not  large,  VTZ/  must  be  zero  to  O  (/,/2).  Also,  from  the  first  boundary 
condition,  %  =  Zi  +  O  (/,  f2 ) .  Multiplying  through  by  p, ,  this  leaves  us  with 

0  =  V^  X,  +  \[f^TX„+h,\  (N.17) 
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If  we  make  one  more  rather  sensible  approximation,  we  can  further  simplify  equation 
(N.  17).  Taking  the  air-water  interface  as  a  prototype  (Region  /  is  now  air  and  Region 
II  is  the  water),  we  note  that  cair  ~  300  m/s  and  as  usual  cwater  =  1500  m/s.  Conversely, 
we  could  go  from  water  (this  time  Region  I)  to  a  medium  where  sound  speed  is  much 
greater  than  water’s  -  for  example,  rock  (Region  II ).  Generalizing,  we  note  that  cu  is 
typically  pretty  large  in  the  denser  medium  (though  not  as  much  so  as  p  itself). 
Furthermore,  since  the  wave  number  is  given  by  k  =  co/c ,  c,  <  cn  also  implies  that 
kj  >  kn  .  Now,  let  us  use  this  insight  to  examine  the  term  in  (N.17)  proportional  to 
VTxn  ■  By  Fourier  decomposing  %n  ,  we  see  that  V7.  roughly  brings  down  a 

iknz  ~  i^kl-kj  cos2(0/;grazing)  =  -k, . 

We  can  typically  only  take  the  branch  that  decays,  so  VTZn  is  n°t  at  a  phase  minimum  as 

is  VtZi  ,  and  thus  f2^rZn  ~  f  'k/Zn  =  j  2k,Zi  ■  Also  note  that  /  ~  l/R ,  where  R  is  the 
radius  of  curvature  of  the  interface.  Let  us  now  compare  the  two  terms  in  (N.17): 
f2VrZ„  and  fZi-  This  amounts  to  comparing  f2kI  and  l/R  ,  or  equivalently  j  and 
1 /  (kjR) ffff.  Now,  keeping  the  first  order  in  the  radius  of  curvature  O  (l/  (ktR))  and 

dropping  the  2nd  order  in  the  slope  ( O  [f2  j )  is  exactly  the  kind  of  approximation  that  we 

make  when  we  choose  to  model  the  scattering  amplitude  using  the  small  slope 
approximation  rather  than  the  composite  model  (Reference  [73],  Section  II).  This  is 

often  a  reasonable  thing  to  do,  and  we  do  it  here.  Thus,  the  0(/2)  term  in  equation 
(N.17)  may  be  considered  small,  and  we  throw  it  out. 

Making  this  last  substitution,  we  have  for  the  Neumann  boundary  condition 

VtZi=~Zi-  (N-!8) 

Equation  (N.  18),  the  result  for  the  Neumann  boundary  condition,  leads  to  curvature- 
induced  boundary  waves  along  concave  surfaces.  Note  that  since  the  boundary  waves  are 
confined  to  concave  surfaces,  they  cannot  lead  to  an  energy  flux  that  propagates  down  a 

ffff  Note  that  the  rescaling  suggested  by  equation  (N.14):  A/8 k^  =>  —  (^  A/^kl  j  (pu  / pL )  =  —{^A/% kj  j 

takes  the  reference  wave  number  k0  and  replaces  it  with  the  wave  number  in  Region  /  ,  kj .  Recalling  the 
virtual-fluctuation  interpretation  of  Appendix  C.2.6,  this  suggests  the  virtual  fluctuations  are  actually  of 
magnitude  Sx  ~  ijlkj  .  This  in  turn  suggests  that  the  size  cutoff  for  determining 

f  =  l/[radius  of  curvature]  should  be  taken  as  the  wave  number  in  Region  I ,  kI . 
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rough  surface.  Thus,  the  effect  associated  with  the  Neumann  boundary  conditions  is 
small  compared  to  the  analogous  effect  along  a  2-fluid  interface.  In  the  latter  case,  there 
is  always  a  concave  surface  available,  and  so  boundary  waves  can  always  be  present  and 
can  even  propagate  downrange.  (As  discussed  in  the  text  (Section  5.3.3),  something 
similar  can  occur  even  for  the  Neumann  boundary  conditions  along  a  bossed  surface, 
where  it  is  possible  for  the  surface  to  be  everywhere  concave.  However,  the  bossed 
surface  lies  outside  the  purview  of  the  parabolic  equation.) 


Next,  let  us  consider  the  Dirichlet  boundary  conditions.  Now,  we  must  view  the  first 
boundary  condition  in  equation  (N.  15)  from  the  point  of  view  of  the  field  in  Region  II 
and  let  Kt  —>  °°  while  pI  — >  0  .  The  first  boundary  condition  in  (N.15)  now  becomes 


Xu  =Xi~ 
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Now,  at  least  in  the  higher-order  terms,  Xi ,  V \Xi  0  •  (To  convince  yourself  of  this 
statement  for  the  latter  quantity,  Fourier  decompose  and  look  at  the  plane  wave  solutions 
on  a  flat  interface.)  This  leaves  us  with  two  terms  that  are  proportional  to  j  .  They 
contain  the  quantities  if/,) Xu  anc*  (^/J^rXi  ■  These  are  both  finite  since  Xu  an4 

V2rXi  both  go  to  zero  as  p,j  p„  .  Therefore,  (as  with  the  Neuman  boundary  conditions), 
we  can  once  again  drop  the  slope-squared  terms  to  get 

f  „ 

Xu  —  Xi  +  ^  tX  ii  ■ 

Set  the  field  on  the  far  side  to  zero  (assert  that  the  wave  is  incoming  on  side  II  and  that 
nothing  bleeds  through  to  the  far  side  —  i.e.,  side  /  ).  This  gives  us 

Zii  =  WVtZi1'  (N'19) 

This  time,  the  O  (/ j  -term  is  insignificant  for  two  reasons.  In  the  first  place,  a  boundary 

condition  of  the  form  (N.  19)  represents  a  small  displacement  of  the  surface,  and  this  will 
not  affect  the  scattered  wave  very  much.  Even  more  importantly,  as  discussed  above,  we 
expect  the  wave  number  in  Region  /  ,  kj  to  be  large  on  the  scales  that  matter  for  the  field 

in  Region  II ,  and  so  1  /k]  is  small.  Thus,  we  drop  the  O  (/)  -term,  and  assert  that  the 

Dirichlet  boundary  conditions  are  not  significantly  influenced  by  the  curvature-induced 
terms  introduced  by  the  Foldy-Wouthuysen  transformation.  The  Dirichlet  boundary 
condition  is  thus  unchanged:  Xn  =  0  •  This  is  no  surprise,  since  the  new  terms  introduced 
by  the  Foldy-Wouthuysen  transformation  are  connected  to  phenomena  that  require 
renormalization  (e.g.,  vacuum  polarization  is  associated  with  renormalized  Quantum 
Electro-Dynamics),  and  Orris  and  Dashen  have  shown  that  the  Dirichlet  boundary 
conditions  only  require  renormalization  at  very  high  orders119,120121.  (As  discussed,  in 
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Section  4.4  and  in  Appendix  O,  renormalization  for  the  parabolic  equation  reduces  to  the 
simple  introduction  of  the  cutoff  k0 ,  or  as  we  have  just  discovered  perhaps  kr .) 


The  results  of  this  appendix  indicate  that  the  parabolic  equation  may  be  a  good  way  to 
study  the  explicit  effects  of  slope  and  curvature  on  scattered  fields.  This  is  related  to  the 
fact  that  a  good  parabolic  equation  conserves  energy,  and  energy  conservation  helps  to 
pacify  some  of  the  pathologies  that  perturbative  field  theories  encounter  with  multiscale 
rough  surfaces.  Specifically,  the  failure  of  perturbation  theories  to  properly  incorporate 
curvature-induced  boundary  waves  is  a  cause  behind  the  failure  of  these  theories  to 
conserve  energy.  This  will  be  discussed  further  in  Appendix  O. 


O  Appendix:  Curvature-induced  boundary  waves  in  the 
context  of  the  development  of  modern  rough-surface 
scattering  theory 

This  Appendix  reprises  developments  in  acoustic  rough-surface  scattering  theory  since 
the  late  1980’s,  and  points  to  remaining  issues  that  may  be  addressed  using  the  “vacuum 
polarization”  series  of  tenns  introduced  into  the  parabolic  equation  by  the  Foldy- 
Wouthuysen  transformation. 


0.1  The  status  quo  in  the  late  1980s 

In  the  late  1980’s,  there  were  a  number  of  competing  approaches  for  calculating  acoustic 
scattering  from  rough  surface. 

For  some  very  idealized  problems,  exact  solutions  were  available.  Sometimes,  “brute 
force”  numerical  calculations  were  used.  These  generally  made  use  of  either  finite 
difference  or  finite  element  approaches.  These  approaches  are  practical  for  1- 
dimensional  and  occasionally  even  2-dimensional  surfaces  with  mild  roughness  (e.g., 
calm  sea  surface),  and  occasionally  for  1 -dimensional  “agitated”  surfaces  (e.g.,  heavy 
seas).  For  agitated  surfaces,  the  size  of  the  scattering  surface  had  to  be  limited. 
Furthermore,  to  get  stochastic  answers,  many  realizations  had  to  be  considered.  The 
calculations  were  quite  demanding  since  many  scales  are  simultaneously  involved  in  such 
scattering  problems.  As  a  consequence,  for  truly  rough  surfaces,  these  brute  force 
approaches  were  at  best  research  tools,  but  they  were  not  particularly  useful  as  general 
algorithms. 
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There  were  also  perturbative  approaches.  These  typically  concerned  the  scattering 
amplitudegggg,  which  is  in  principle  related  to  the  scattered  field  everywhere  via  a  2- 
dimensional  Fourier  transform.  In  practice,  the  scattering  amplitude  was  typically  used  to 
calculate  the  far  field,  which  is  simply  proportional  to  it,  or  the  far  field  scattering  cross- 
section,  which  is  proportional  to  the  magnitude  squared  of  the  scattering  amplitude.  For 
quasi-planar  rough  surfaces,  the  quantity  of  interest  was  the  stochastic  scattering  cross- 
section  per  unit  area,  which  measured  in  decibels,  becomes  the  scattering  strength  that  is 
generally  substituted  into  the  sonar  equation.  Once  the  quasi-planar  rough  surface  is 
introduced,  one  generally  drops  the  specular  8  -function,  and  only  examines  the 
incoherent  wave.  (The  coherent  component  has  a  different  spreading  center  than  does  the 
incoherent  wave,  and  so  extreme  care  is  needed  if  the  specular  component  is  retained  in 
the  far-field  formalism.) 

As  of  the  early  1980’s,  there  were  two  widely  used  approximations  to  calculate  the 
scattering  amplitude.  The  Kirchhoff  approximation  dates  from  the  19th  century,  and  it 
was  used  in  the  high  frequency  limit  to  calculate  the  scattering  from  gently  undulating 
(i.e.,  slowly  varying  on  the  scale  of  a  wavelength)  surfaces.  For  wavelength  scale 
scattering  (also  known  as  Bragg  scattering),  perturbation  theory  developed  by 
Rayleigh  ,  Rice  “  and  Watennan  “  was  used.  Both  approaches  remained  the  subject  of 
much  tinkering  and  controversy.  In  fact,  it  was  not  until  the  mid  1980’s  that  the 
relationship  between  the  Kirchhoff  and  perturbation  approximations  was  established 
independently  by  Berman  and  Perkins  ,  and  by  Holliday  . 

A  pair  of  useful  new  approximations  applicable  to  acoustic  scattering  from  the  ocean 
surface  (i.e.,  Dirichlet  boundary  conditions)  was  developed  between  the  early  1960’s  and 
the  mid  1980’s.  Various  groups  in  Russia  “  ’  and  S.  McDaniel  (with  collaborators)  in 
the  United  States  ~  ’  developed  the  composite  model  (also  known  as  the  2-scale 
model),  which  glued  physical  and  geometrical  acoustics  together,  forcing  the  spectrum  of 
the  latter  regime  to  terminate  at  some  fixed,  empirically  chosen  maximum  wave  number 
(typically  about  the  Bragg  wave  number).  The  mid  1980’s  also  saw  the  emergence  of 
a  new  approximation  that  remains  extremely  useful  to  this  day:  the  small  slope 
approximation  by  Voronovich131'1’2.  Its  properties  have  since  been  firmly  established  by 
Thorsos  and  Broschat  ’  ,  and  it  has  been  extended  to  a  wide  variety  of  fields  and 

boundary  conditions  ’  ’  ’  ’  .  The  small  slope  approximation  unifies  the  Kirchhoff 

approximation  and  perturbation  theory  by  reducing  to  each  in  the  proper  limit.  The 
Kirchhoff  approximation  and  the  composite  model  have  largely  been  replaced  by  the 
small-slope  approximation,  and  this  approximation  has  become  the  new  standard  method 
for  calculating  the  scattering  amplitude  and  cross-section  for  a  quasi-planar  rough 
surface.  (Perturbation  theory  remains  in  widespread  use  as  well,  because  it  does  not 
contain  an  integration  and  is  consequently  simpler  to  evaluate  than  the  small  slope  result.) 


gggg  An  exception  would  be  the  perturbative  normal  mode  approach  of  Kuperman  and  Schmidt  mentioned 
in  several  places  in  this  report,  including  for  example  in  Table  6.1. 
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0.2  Towards  a  new  scattering  theory 


The  various  techniques  available  by  the  late- 1 980 ’s  all  had  some  qualitative  or  formal 
validity  criteria,  but  there  was  nothing  to  provide  an  overall  context  for  the  various 
approaches.  This  void  helped  generate  a  widespread  distrust  of  the  results  of  rough 
surface  scattering  theory.  When  in  the  mid- 1980’s  it  became  clear  that  theory  and 
experiment  were  not  in  agreement  (sometimes  they  differed  by  a  factor  of  100),  it  was 
claimed  by  many  that  the  rough-surface  scattering  theories  were  inadequate,  and  that  this 
mechanism  could  still  not  be  ruled  out.  Since  perturbation  theory  works  well  in 
predicting  HH  electromagnetic  scatteringhhhh  when  the  same  air-sea  interface  is  impacted 
from  the  other  side139,  it  should  have  been  clear  all  along  that  scattering  theory  was  not 
the  problem,  and  indeed  it  was  eventually  established  that  the  true  cause  of  the  anomaly 
was  scattering  from  near-surface  bubble  clouds140.  Nevertheless  the  controversy  of  the 
late-1980’s  sparked  new  developments  in  scattering  theory. 

For  example,  this  state  of  affairs  prompted  R.  Dashen,  et.  al.  to  begin  a  wide-ranging 

I'X 

examination  of  the  problem  of  scattering  from  the  rough  air-sea  interface  .  Even  before 
this  effort  began,  it  had  already  become  clear  to  Dashen  that  there  was  a  strong  need  to 
relate  the  existing  approaches  to  one  another  and  to  a  clear  physical  intuition  (including 
an  inventory  of  which  physical  processes  the  various  approximations  included  and  which 
they  did  not).  This  need  would  be  addressed  by  developing  an  overarching  “theory  of 
theories”  that  would  generate  a  taxonomy  for  the  various  scattering  models.  This  matrix 
of  theories  would  then  be  used  to  fill  in  the  gaps  in  our  knowledge  of  the  problem.  In 
particular,  we  would  be  able  to  apply  a  given  approach  to  a  wide  variety  of  fields  and 
boundary  conditions,  to  develop  new  approximations,  and  most  importantly  to  conduct  a 
systematic  search  for  the  missing  physics. 

The  basic  technique  was  proposed  by  Dashen,  and  I  had  the  opportunity  to  participate 
with  him  in  its  development  and  application  to  wide  variety  of  situations.  At  the  core  of 
the  new  approach  was  the  realization  that  for  modem  20th  century  field  theories,  a  clear 
hierarchy  of  possible  approximate  solutions  falls  out  naturally.  To  this  was  added  the 
insight  that  the  difference  between  fields  that  were  discovered  in  the  19th  century  and 
those  that  were  discovered  in  the  20th  century  is  the  approach  used  to  describe  them  and 


hhhh  An  HH  polarized  electromagnetic  wave  scattering  from  a  perfect  conductor  obeys  the  same  Dirichlet 
boundary  condition  that  as  that  obeyed  by  an  acoustic  field  incident  on  the  air-sea  interface  from  below. 
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not  the  underlying  physics1111.  During  the  19th  century,  fields  were  traditionally  described 
using  a  “tour  de  force”  of  vector  calculus  and  formal  mathematical  manipulation,  while  in 
the  20th  century  field  theories  have  often  been  “bootstrapped”  from  symmetry  arguments. 
Thus,  the  goal  of  our  effort  was  to  rederive  classical  field  theory  using  modem 
techniques. 

To  be  specific,  the  expression  for  the  scattering  amplitude  was  derived  using  a  symmetry 
principle:  Noether’s  theorem.  Roughly  speaking,  when  a  symmetry  leads  to  a  conserved 
current,  Noether’s  theorem  associates  a  source  term  for  the  current  with  the  symmetry 
breaking  operation.  Time  reversal  invariance  was  the  symmetry  invoked,  and  the 
conserved  current  was  defined  by  Lorentz’s  lemma.  The  symmetry  was  broken  by 
stipulating  that  the  reciprocal  problem  scatters  from  a  perturbed  surface.  The  associated 
“source”  in  the  current-conservation  equation  was  the  change  in  the  scattering  amplitude 
associated  with  the  perturbation  of  the  scattering  surface.  The  result  was  a  new  2-scale 
theory  (i.e.,  composite  model)  that  was  exact  with  respect  to  the  reference  problem  and 
first  order  with  respect  to  the  perturbation  of  the  scattering  surface™.  The  result  was 
obtained  for  acoustic  field135’141,  electromagnetic141  fields,  and  eventually  also  for 
elastodynamic  fields  . 

The  reciprocal  2-scale  theory  was  then  used  to  spawn  a  family  of  approximations.  An 
infinitesimal  transfonnation  of  this  result  leads  to  an  exact  new  manifestly  reciprocal 
expression  for  the  scattering  amplitude  ’  ’  ’  .  Using  this  result,  it  is  almost  trivial  to 

obtain  the  lowest  order  (manifestly  reciprocal)  small  slope  approximation  for  a  wide 
variety  of  fields  and  boundary  conditions  ’  ’  .It  was  also  possible  to  derive  an 

optimal  “local”  approximation — the  “small  curvature”  approximation  .  The  formalism 
also  led  to  shortcuts  to  higher-order  perturbation  theory,  cleared  up  the  relationship 
between  the  various  scattering  theories  mentioned  above,  and  provided  a  reciprocal 
scattering  amplitude  for  collections  of  scatterers  near  a  rough  surface  (where  the 
distribution  of  scatterers  as  a  function  of  the  depth  follows  the  surface  as  it  is 
perturbed)  . 


1111  Concerning  quantum  fields  versus  classical  fields,  the  measurement  problem  and  the  associated  issues 
related  to  the  collapsing  wave  packet  is  apparently  an  exception  (although  if  the  proponents  of  decoherence 
generated  internally  to  field  theory  are  correct,  then  even  this  distinction  blurs),  but  this  issue  is  not  (at 
present)  addressed  within  field  theory.  What  is  addressed  is  the  gradual  development  of  the  quantum  field 
as  a  function  of  time  and  space,  and  this  is  entirely  analogous  to  the  development  of  a  classical  field. 
(Quantum  fields  also  often  contain  rich  algebraic  structures  that  are  not  typically  found  in  classical  fields, 
but  this  distinction  is  also  irrelevant  to  the  discussion  here.) 

mi  j|lc  eariier  composite  (or  2-scale)  model  developed  in  Russia  and  the  United  States  between  1960’s  and 
the  early-1980’s  will  henceforth  be  called  the  “old”  composite  model. 
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Since  it  is  exact  with  respect  to  the  reference  problem,  the  new  2-scale  theory  models 
non-local  effects  associated  with  the  reference  surface.  If  the  perturbation  is  taken  to  be 
the  Bragg-scale  features  and  the  reference  surface  is  taken  to  be  the  “big”  waves,  then  the 
non-local  effects  include  diffraction  as  well  as  effects  that  in  the  geometrical  acoustics 
limit  become  shadowing  and  multiple  scattering.  This  new  2-scale  theory  was  used  by  a 
group  at  Dynamics  Technology142  to  model  radar  scattering.  The  related  report  contained 
the  perceptive  observation  that  the  principal  advantage  of  the  reciprocal  2-scale  result  is 
that  it  properly  models  all  the  paths  involved  in  a  scenario  where  the  field  scatters  twice 
from  the  surface  in  the  following  manner:  once  with  the  wavelength  scales  somewhat 
larger  than  a  wavelength  (via  diffraction  or  locally-specular  scattering)  and  the  other  time 
via  Bragg  scattering.  In  reference  [73],  where  the  contribution  associated  with  the 
reference  surface  was  obtained  to  2nd  order,  it  was  recognized  that  these  (non-local) 
scattering  channels  cover  the  dominant  parts  of  4th  order  perturbation  theory  (for  the 
stochastic  cross  section),  and  so  the  2-scale  result  was  used  to  get  a  serviceable 
approximation  to  4th  order  perturbation  theory  (for  the  Dirichlet  boundary  conditions). 

The  reciprocal  scattering  theory  was  used  in  several  additional  contexts.  Dashen  et  al. 
used  it  to  create  a  new  theory  of  scattering  from  finite  objects143.  Small  slope 
approximations  developed  using  the  reciprocal  scattering  formalism32,138  are  currently 
being  used  to  model  scattering  from  elastic  ocean  bottoms144,145  and  from  the  air-sea 
interface145. 

As  the  reciprocal  theory  was  developed,  other  approaches  were  being  successfully 
implemented  elsewhere.  Thorsos  has  worked  extensively  with  4th  order  perturbation 
theory146,147.  M.  Milder  designed  an  operator  expansion  method  that  is  both  elegant  and 
has  proven  very  useful148,149,150.  Voronovich  has  developed  a  non-local  version  of  the 
small  slope  approximation1^1.  Numerical  studies  by  Thorsos  and  collaborators  have 
produced  an  “empirical  taxonomy”  of  approximations  based  on  their  observed  range  of 
validity  as  a  function  of  parameters  characterizing  the  scattering  surface134,150,152. 


0.3  Shortcomings  inherent  to  perturbative  models  of  rough- 
surface  scattering 

Despite  all  this  progress,  the  theoretical  approach  has  remained  incomplete.  The  issues 
involved  have  yet  to  play  any  direct  role  in  practical  applications  in  underwater  acoustics, 
and  so  the  topic  has  remained  fairly  obscure.  However,  it  is  of  physical  and 
mathematical  interest,  and  as  such  one  suspects  that  it  will  find  its  way  into  applications 
once  the  related  phenomena  are  understood.  The  physics  that  is  obscured  by  the  various 
perturbative  approaches  in  wide  use  today  manifests  itself  in  four  ways: 

•  Energy  is  not  conserved  on  an  order-by-order  basis. 

•  Curvature-induced  boundary  wave  phenomena  and  the  somewhat  related  effects 
that  depend  explicitly  on  the  tilt  are  relegated  to  high  orders  where  they  become 
very  illusive  multiple  scattering  effects. 
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•  In  principle,  at  some  point,  the  need  arises  to  renonnalize  these  theories.  For 
standard  perturbation  theory,  this  is  typically  at  very  high  orders119'120  ”121,  but  the 
issue  can  appear  already  at  first  order  (e.g.,  the  small  curvature  or  arctangent 
formulas  derived  in  reference  [32]).  If  uncorrected,  the  need  for  renormalization 
leads  to  infinities  for  fractal  surfaces  and  unphysical  sensitivity  to  scales  far  below 
a  wavelength  for  self-affine  (cutoff  fractal)  surfaces  as  well  as  for  any  other 
surfaces  whose  spectra  do  not  rapidly  decay  to  insignificant  levels  at  scales  below 
the  Bragg  scale. 

•  On  the  other  hand,  those  orders  of  perturbative  scattering  theories  that  do  not  raise 
renormalization  issues  fail  to  properly  account  for  scattering  from  surface  features 
that  are  smaller  than  the  Bragg  scale,  but  could  still  be  physically  relevant 
according  to  the  uncertainty  principle.  These  features  constitute  a  third  scale  that 
comes  in  addition  to  the  two  scales  usually  discussed1 19:  the  Bragg  scale  and  the 
“big”  scales  (>  wavelength). 

0.3.1  Energy  non-conservation  and  boundary  waves 

A  number  of  researchers  have  recognized  the  connection  between  the  first  two 
manifestations  of  the  phenomenon:  energy  non-conservation  and  curvature-induced 
boundary  waves.  Thorsos  has  based  his  argument  on  the  insight  that  for  1st  order 
perturbation  theory  corresponding  to  the  Neumann  boundary  condition,  as  the  grazing 
angle  goes  to  zero,  the  incoming  flux  goes  to  zero,  but  the  scattering  cross-section  does 
not.  This  violates  energy  conservation,  and  Thorsos  has  identified  boundary  waves  as  the 
culprits  behind  this  anomaly  .  Soto-Crespo  et  al.  wrote  about  the  same  phenomenon  in 
the  context  of  electromagnetic  theory.  They  attributed  the  failure  of  energy  conservation 
to  what  they  called  polaritons,  which  are  the  curvature-induced  boundary  waves.  Tang 
and  Frisk  have  also  noted  the  existence  of  curvature-induced  boundary  waves.  On  a 
track  parallel  to,  but  outside  the  framework  of  modern  scattering  theory,  Biot  and  Tolstoy 
have  extensively  studied  curvature-induced  boundary  waves  along  bossed  surfaces.  This 
work  is  briefly  examined  at  the  end  of  Subsection  5.3.3. 


0.3.2  The  third  scattering  scale  and  renormalization 

Some  time  ago,  Dashen  and  Orris  made  the  connection  between  the  third  and  fourth 
manifestations  of  this  phenomenon1 19120121  in  this  regard,  it  is  worth  quoting  the 
abstract  from  Dashen’ s  talk  at  the  120th  meeting  of  the  Acoustical  Society  of  America  in 
1990* 19: 

“Theoretical  work  on  scattering  from  rough  surfaces  is  often  plagued  by  an  extreme 
range  of  scales  of  surface  roughness.  In  particular,  a  wide  range  of  scales  can  make 
numerical  calculations  impractical.  Roughness  on  scales  longer  than  a  wavelength  can  be 
handled  by  an  improved  composite  model.  Scales  smaller  than  a  wavelength  can,  in 
certain  circumstances,  cause  serious  trouble.  The  cross  section  can  depend  explicitly  on 
the  cutoff  at  high  wave  numbers.” 
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In  this  talk,  Dashen  primarily  referred  to  perturbation  theory,  but  the  issue  of  a  cutoff  also 
appears  in  the  implementations  of  Dashen’ s  small  curvature  formula32  (also  known  as  the 
arctangent  fonnula).  This  too  was  an  early  motivating  factor  for  Dashen’s  attention  to 
the  third  scattering  scale.  Among  other  things,  he  was  looking  for  a  proper  way  to  cutoff 
approximations,  which,  like  the  small  curvature  formula,  addressed  the  physics  of  the 
third  scattering  scale.  (It  should  be  noted  that  formally  the  cutoff  issue  for  the  small 
curvature  fonnula  was  related  to  the  slope  /  and  not  the  curvature  f .  This  is  ultimately 
the  result  of  a  hidden  integration  by  parts,  and  the  1st  order  contribution  does  indeed 
originate  from  the  curvature  f  (see  the  discussion  in  Appendix  A  of  reference  [32],  and 
in  particular  equation  A22  in  the  reference).) 

Before  proceeding,  it  should  also  be  noted  that  Voronovich  and  Zavorotny  have  recently 
introduced  a  small  slope  approximation  that  includes  the  curvature  "  .  In  this  case,  the 
curvature  is  cutoff  at  the  upper  limit  of  the  “big”- wave  scale.  The  physics  involved  is 
diffraction  and  similar  issues  in  physical  acoustics,  but  the  third-scale  physics  with  its 
related  renormalization  issues  does  not  come  up  in  this  work.  In  this  sense,  it  is  a  little 
like  the  old  composite  model  ’  ’  ’  ,  where  the  cutoff  was  imposed  on  the  slope  at 

the  upper  end  of  the  “big”-wave  scale  (reference  [  ]  includes  a  look  at  the  process  of 
cutting  off  the  “old”  composite  model). 

The  connection  between  the  need  for  renonnalization  and  the  sensitivity  to  small  scales  is 
an  intimate  one.  As  noted  in  the  text  (see  Section  4.4),  it  comes  from  the  fact  that  the 
curvature  f  and  the  slope  j  are  “broad-spectrum”  parameters  that  depend  on  all  scales. 
They  are  especially  sensitive  to  the  smallest  scales.  This  means  that  terms  proportional  to 
these  parameters  will  not  miss  the  bona  fide  physical  effects  associated  with  the  sub¬ 
wavelength  scale,  but  it  also  means  that  these  terms  will  have  trouble  accounting  for  the 
mechanism  that  imposes  the  uncertainty  principle  on  a  classical  field.  The  real  effects  are 
not  clearly  delineated  from  the  artifacts.  This  is  why  third-scale  scattering  phenomena 
are  invariably  tied  up  with  the  issue  of  renormalization. 

The  renonnalization  of  full-wave  perturbative  theories  is  generally  a  cumbersome  process 
involving  the  repeated  resummation  of  the  perturbative  series.  As  the  orders  are  remixed, 
cancellations  that  used  to  occur  between  orders  now  occur  within  a  given  order.  Most 
noticeably,  any  infinities  now  properly  cancel  each  other  out  on  an  order-by-order  basis. 
Rather  than  deal  with  all  the  complications  related  to  renormalization,  one  typically 
throws  out  the  legitimate  physical  effects  associated  with  subwavelength  scales.  To 
change  this,  we  will  need  a  method  to  bring  curvature/tilt-induced  third-scale  physics 
(including  especially  boundary  wave  phenomena)  down  to  accessibly  low  orders,  and  we 
will  also  need  a  relatively  workable  solution  to  the  renormalization  issue. 


0.4  The  parabolic  equation  is  a  good  way  to  examine  the  issues 
left  unresolved  by  perturbative  theories 


297 


The  parabolic  equation  generated  by  the  Foldy-Wouthuysen  transformation  manifestly 
accomplishes  the  first  goal  listed  in  the  last  paragraph  of  Subsection  0.3.2  just  above. 

We  have  contributions  that  are  1st  order  in  the  curvature  ( O  (/)  )  and  2nd  order  in  the 

slope  ( O  [f2 ) ).  These  orders  are  certainly  low.  As  discussed  in  Section  4.4,  there  also 

exists  a  rather  simple  workaround  to  the  issue  of  renormalization.  The  surface  spectrum 
is  a  function  of  k ,  a  wavevector  in  the  average  plane  of  the  quasi-planar  surface  (i.e.,  the 

x-y  plane).  (Fora  1 -dimensional  surface,  k  — >  kx .)  Fourier  decomposing  the  field  into 

plane-wave  components,  a  given  component  of  the  surface  will  more  or  less  excite  the 
component  of  the  field  whose  wave  vector  projects  to  k  on  the  x-y  plane.  This  projected 
wavevector  can  have  any  magnitude  between  0  and  k0 .  Thus,  we  are  well  justified  in 
mimicking  quantum  mechanics  and  choosing  our  upper  spectral  cutoff  to  be  k0 .  From 
the  uncertainty  principle  AkArx_y  >  /  (  rx_y  is  a  distance  in  the  x-y  plane),  this  gives  us 


A  r  >  =  — 

x-y  2  Ak  2  kn 


A . 

4n  ' 


The  field  cannot  resolve  features  smaller  than  about  /2  wavelength,  and  the  third  scale 


lies  roughly  between  \  and  /,/4;r  . 


Using  the  /,/4;r  cutoff  to  calculate  the  renormalized  values  of  /  and  /2  is  adequate  as 

a  first  guess.  Replacing  these  parameters  with  their  renormalized  values  is  an  example  of 
a  reduction  formula  (see  reference  [76],  p.  438).  It  is  completely  analogous  to  the 
“guess”  for  a  cutoff  on  the  energy  spectrum  that  Bethe  used  in  calculating  the  Lamb  shift 

(except  recall  that  he  already  effectively  had  a  problem  with  ^/2^  ;  see  Section  4.4). 

(Bethe’s  calculation  can  be  found  in  reference  [76],  p.  593.)  This  issue  arose  in  the 
context  of  a  low  energy  approximation  that  depends  on  assumptions  that  are  similar  to  the 
low-grazing-angle  small-backscatter  approximation  implicit  in  the  parabolic  equation. 
Bethe  chose  the  rest  mass  m0  as  his  high-energy  cutoff.  Our  cutoff  k0  plays  the  same 

role  in  the  Helmholtz  equation  as  the  rest  mass  m0  plays  in  the  Klein-Gordon  equation, 
and  k0  also  plays  the  same  role  in  the  parabolic  equation  as  m0  does  in  the  Schrodinger 
equation.  Thus,  Bethe’s  choice  of  m0  as  a  cutoff  is  completely  analogous  to  our  choice 
of  k0 .  Bethe  obtained  good  results  with  his  technique.  However,  it  is  good  to  keep  in 
mind  that  it  is  ultimately  better  to  use  experimental  values  for  the  effective  curvature 
/effective  and  effective  slope  squared  /e2fective ,  or  better  yet,  to  use  the  full  power  of  field 
theory  to  include  higher-order  “radiative”  corrections  (see  reference  [76],  p.  593-594). 

For  now,  we  will  be  taking  the  easy  way  out,  and  impose  the  cutoff  k0 . 


Note  that  if  we  were  operating  in  full- wave  theory,  we  would  also  have  to  allow  for 
backscatter  and  a  given  component  of  the  surface  could  also  excite  a  backward  traveling 
wave  with  wave  number  —k  .  This  leads  to  some  contribution  for  the  -k0  to  0  wave 
number  range,  but  this  contribution  has  to  weighted  differently  than  the  positive 
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contribution.  There  is  no  obvious  way  to  do  so,  and  so  getting  the  needed  reduction 
formulas  is  no  longer  as  easy  as  choosing  a  single  value  for  the  cutoff  wave  number! 

Here  we  have  the  second  big  advantage  of  the  parabolic  equation.  (Recall  that  the  first 
was  that  these  effects  are  of  low  order.) 

Thus,  the  parabolic  equation  takes  a  subtle  group  of  physical  effects  associated  with 
curvature,  slope  and  sub-wavelength  scales,  and  makes  them  accessible  by  bringing  them 
to  low  orders  (essentially  by  stripping  off  the  local  component  of  these  effects  and 
separating  it  out  from  associated  multiple  scatter  effects).  It  also  offers  a  serviceable 
workaround  to  the  renonnalization  problem. 


0.5  Final  thoughts 

As  an  interesting  side  note,  we  remark  that  in  stochastic  scattering  theory,  the  third-scale 
physics  will  involve  the  coherent  component  proportional  to  \  f2j(z)  and  the  incoherent 

component  proportional  to  ■  8zj .  The  contribution  to  the  scattering  of  the  incoherent 

wave  is  associated  with  the  curvature.  This  illustrates  why  most  of  the  attention  in  the 
rough  surface  scattering  community  (which  usually  considers  the  stochastic  incoherent 
field)  has  focused  on  curvature-induce  boundary  waves.  Furthennore,  as  we  have  seen  in 

the  main  body  of  the  paper,  the  tilt-induced  contribution  oc  (J^'j^z)  disappears  entirely 

when  there  is  no  density  jump  (Section  4.3),  and  it  is  a  very  modest  effect  when  there  is  a 
density  jump  (Subsection  5.3.4).  The  tilt-induced  effect  is  thus  insignificant  in  the 
stochastic  problem.  It  is  of  primary  interest  in  the  deterministic  problem. 

Finally,  let  us  note  two  interesting  phenomena  that  are  not  covered  by  the  formalism 
discussed  here.  Enhanced  backscatter  is  obviously  not  compatible  with  the  forward- 
propagating  parabolic  equation.  Whispering  gallery  waves  bear  a  superficial  similarity 
with  the  curvature-induced  boundary  waves  introduced  here,  primarily  because  they  are 
curvature-induced  boundary  waves  associated  with  concave  hard  surfaces.  However, 
they  are  not  a  small-scale  effect  at  all.  They  are  usually  understood  in  terms  of 
geometrical  acoustics,  where  rays  will  bunch  up  along  a  concave  surface.  Furthermore, 
looking  at  equation  (N.18),  we  see  that  this  type  of  curvature  induced  boundary  wave  has 
an  e  -folding  distance  of  4  times  the  radius  of  curvature,  while  whispering  gallery  waves 
are  confined  to  a  fraction  of  the  radius  of  curvature  of  the  surface  that  generates  them. 

Nevertheless,  the  parabolic  equation  complete  with  the  “vacuum  polarization”  set  of 
tenns  introduced  by  the  Foldy-Wouthuysen  transformation  holds  real  promise  for  casting 
light  on  a  heretofore  poorly  understood  class  of  physical  effects  associated  with 
curvature,  slope  and  sub-wavelength  scales. 
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P  Appendix:  The  acoustic  Lamb  shift  when  the  density 
and  compressibility  both  jump 

This  appendix  examines  the  stochastic  boundary  conditions  for  the  quasi-first  order 
theory  of  a  density/sound  speed  jump.  Section  P.l  provides  the  details  of  the  derivation 
of  the  boundary  conditions  (equation  (5.20)),  Section  P.2  a  discussion  of  the  meaning  of 
this  result,  and  Section  P.3  a  discussion  of  adjustments  to  the  stepping  procedure  peculiar 
to  this  quasi-first  order  theory. 


P.  1  The  derivation  of  the  boundary  conditions  for  the  stochastic 
problem 


The  boundary  condition  on  the  field  %  is  considered  in  Subsection  P.1.1  and  the 
boundary  condition  involving  the  first  derivative  c)% /c)z  is  considered  in  Subsection 
P.1.2.  Subsection  P.  1 .3  provides  a  lemma  used  in  Subsection  P.  1 .2  and  Subsection  P.  1 .4 
provides  a  lemma  used  in  the  derivation  of  both  boundary  conditions. 


Let  us  start  with  the  deterministic  boundary  conditions  (5.16)  valid  along  the  surface 
z  =  / .  These  boundary  conditions  are  written  here  in  slightly  different  form: 


Xi  z=/  Xu  z=/ 
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The  derivation  of  equation  (5.16)  depended  on  the  assumptions  described  by  Figure  5.1, 
and  these  must  continue  to  hold  here.  Specifically,  we  are  operating  in  2-dimensional 
x-z  space,  where  space  is  separated  into  two  regions  by  a  quasi-planar  rough  surface 
z  =  /  (x) .  The  positive  z-axis  points  from  Region  II  into  Region  /  ,  and  the  average 

plane  of  the  rough  surface  is  along  the  line  z  =  0 .  Now  we  add  one  new  condition: 
assume  that  /  defines  a  random  rough  surface  that  obeys  Gaussian  statistics.  As  always, 
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K  is  the  compressibility  is  and  p  is  the  density.  As  before,  also  assume  that  K f  >  Kn 
and  Pj  <  pn  ,  and  that  the  reference  values  are  chosen  to  be  the  larger  value.  The 
reference  wave  number  is  therefore  given  by  k0  =  0)^K tpu  ,  where  co  is  a  frequency. 

Now,  follow  the  stochastic  procedure  for  the  cusp  (Subsection  3.2.3)  and  the  sound  speed 
jump  (Section  4.3),  and  as  in  these  sections,  adapt  the  technique  originally  developed  by 
Kuperman65.  To  this  end,  we  will  project  the  boundary  conditions  at  z  =  /  down  to  the 
line  z  =  0 ,  break  the  wave  function  evaluated  at  z  =  0  into  coherent  and  incoherent  parts: 
%\  =  (z)(0)  +  $X  ■>  and  average  these  boundary  conditions. 


P.1.1  The  boundary  condition  for  the  field  x 


Thus,  we  begin  with  an  updated  version  of  equations  (3.27)  and  (4. 1 1). 
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Now  recast  the  second  boundary  condition  in  equation  (P.l): 
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Let  us  symmetrize  this  result.  This  time  recast  the  second  boundary  condition  in  equation 
(P.l)  this  way: 
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Add  (P.4)  and  (P.5)  and  divide  by  2  to  get 
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Now,  expanding  on  each  side  of  the  interface  independently  through  1st  order  in  / ,  we 
have 
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Similarly  through  0th  order  in  /  ,  we  also  have  p% 
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these  expansions  into  (P.7)  and  multiplying  by  -/  gives  us 
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Now  plugging  result  (P.8)  into  (P.3) 
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3rd  order 


The  o( / 2  j  term 
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Note  that  the  term  in  (P.8)  proportional  to  f28p  ■  (d2%/dz2 )  now  generates  two  new 

tenns  each  marked  by  a  factor  of  %  .  We  picked  up  the  factor  of  2  in  the  numerator  to 

cancel  the  overall  factor  of  !4  out  in  front  of  the  O  (  /2 )  term,  and  we  also  picked  up  a 

factor  of  2  in  the  denominator  since  we  busted  apart  the  average.  Also  in  the  O (f2) 

term,  we  have  dropped  the  designation  identifying  the  depth  coordinate,  since  the  term  is 
already  2nd  order,  and  we  will  be  taking  advantage  of  this  to  move  freely  between  z  =  / 

and  z  =  0 .  To  remind  ourselves  that  we  are  doing  this,  we  will  retain  the  O  (/3 )  error 

designation. 

Let  us  rework  the  O  (f2 )  term  in  equation  (P.9).  The  goal  is  to  symmetrize  this  term: 

i.e.,  get  this  term  into  a  fonn  involving  averages  at  the  interface  rather  than  values  on  one 
side  or  the  other.  To  begin,  note  that 

I  _  |  Pu  ~  Pi  _  Pu  .  ^  —  1  RlL  zjh.  —  £± 

Pi  Pi  Pi  '  Pn  Pn  Pi,  ’ 

Substitute  this  result  into  the  O  (f2 )  term  in  equation  (P.9): 
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Now,  to  get  this  into  a  form  involving  averages  on  the  interface,  we  have  to  continue  with 
the  following  manipulations: 

=  f_( P„  })2X,  Pi  d2Zn  V  f  f  Pu*2X,  Pid2Z,A  hQ(f3 

4  [p,  dz2  pu  dz 2  J  4  [P!  dz2  pu  dz2  ) 


=  f_( ( Pi,  {d2Z,  P,  })2X„ 

4  [Ip/  J  dz 2  Pu  ^2  y 


, /2 r Pu d2z,  d2Zj/  (  Pt' 

4  [  p,  dz2  dz2  ^  p„  J 


Reorganizing  these  terms  as  follows: 
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,  f2  [ d"Zi  P,  d2Xn  1  |  f2  r Pu  &X,  d2Zu  1  |  Q  /  r 3 \ 

4  dz2  Pu  dz 1  4  pj  dz2  dz2 


the  0(/2)  tenn  in  equation  (P.9)  becomes: 


f2  S  \  1  d2Z  1  ,  f2  r  d2Z/  P,  d2Zn  1  |  f2  [ A/ 

2  p  dz2  4  dz2  pn  dz 2  4  pI  dz2  dz2 

Use  Lemma  2  Use  Lemma  2 


+o(/sr 


(P.10) 


Now,  use  Lemma  2  in  Section  P.  1 .4  below.  Substitute  for  the  last  two  terms  in  equation 
(P.10)  using  equations  (P.34)  and  (P.35): 

TheO(/2)  tenn 

=  y'V>  +  y  (»*■>,  4Z )  +  y  ( A, A  Z) +  <->(/-  /, J') 

=  f-Sp  +f<{-^d.Az+0(f-/,f) 

=  y Sp  iLf  +<fAp-z  +  0(f-f,f). 


Now,  recall  equation  (P.6)  to  replace  p  ■  X  with  px  .  Then  use  the  definition  of  B  , 
equation  (P.2),  to  get: 

-f#|  +4 klf-AXx  +0(f-f,f).  (P.ll) 

p  dz  .  v  ’ 

This  is  the  symmetric  form  we  are  looking  for.  Since  we  are  already  at  2nd  order,  we 
have  used  our  freedom  to  evaluate  X  down  at  z  =  0 . 

Finally,  to  finish  our  evaluation  of  equation  (P.3),  we  need  to  obtain  Xi{f)~  Xu{f)  • 

Use  the  first  boundary  condition  in  equation  (P.l),  and  the  usual  Taylor  series  expansion 
to  get 
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Xi  z=/  Xu  z=/ 


=  ~f2ApX  | 
-Bf2 


z= 0 


Bf 


J_dz 

p  dz 


-Bf  ■  f 


1  d2X 

p  dz2 


Jz=0 


1 


(P.12) 


+  3rd  order. 


-=o 


p  dz2 

Thus,  take  equation  (P.9),  and  substitute  results  (P.l  1)  and  (P.12)  to  get 


Xj{0)~ Xu  (0)  =  - (f2  +  f  ■  f)  A pz[=o  - 

-Bf 

1  1 

^  i- 

1 _ 1 

:=0 

-B(kf+f2) 

\  1  d2z~ 

p  dz2  \ 

+  8k2f-L 

z= 0 

1 - 1 

^  i  - 

1 _ 1 

(P.13) 


Jz=0 


+4k2f2B 


±d\r 

p  dz2 


+  4  k2J2Apx 


+  3rd  order. 


z=0 


Jz=0 


Now,  again  follow  Kuperman65,  and  Sections  3.2.3  and  4.3.  In  equation  (P.13),  set 

X(0)  =  (x)i0  +  Bx 


On  either  side 


and  take  the  average.  The  result  is 

U,)U-U„)U=4 K{f)B 


i 

p  dz2 


+ 


4k;(f2)Ap{Z) 


z= 0 


z=0 


(P.14) 


+  4th  order. 


Diffuse  (Bragg)  scatter  terms 


Equation  (P.  14)  indeed  provides  the  first  boundary  condition  of  equation  (5.20).  Note 
that  this  result  once  again  mimics  the  tilted  interface  with  f2  — >  -4  k2  if2  ''j  .  The  error  is 

4th  order  in  the  stochastic  problem,  because  for  Gaussian  surfaces,  the  average  of  odd 
powers  is  always  zero.  Also  note  that  as  with  the  interface  where  the  sound  speed  (or 
compressibility)  alone  jumps,  the  “vacuum  polarization”  terms  do  not  survive  the 
averaging  process.  This  will  not  be  true  for  the  second  boundary  condition,  which  we 
derive  next. 


P.1.2  The  boundary  condition  on  the  derivative  dj/dz 

Now,  as  with  Sections  3.2.3  and  4.3,  let  us  go  on  obtain  the  boundary  condition  on  the 
first  derivative.  Modifying  equation  (3.30)  or  equivalently  (4.14),  we  have 
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1  dZ, 

1  dZn 

__  1  dZ, 

1  dZu 

P,  dz 

N 

CO 

o 

N 

N 

II 

O 

1 

02 

N 

Z=f 

P„  dz 

{  1  d2Z, 

1  #XU 

\ 

Pi  dz2 

z=f 

pn  dz2 

z=/y 

1  d3Z, 

1  d3Zn 

\ 

pi  dz3 

z=f 

Pi,  dz3 

Now  use  the  second  boundary  condition  in  (P.l)  to  replace  the  first  two  terms  and 
rearrange  the  O  ( / )  term  to  get: 


1  dXi 


P,  dz 

f2 


1  dX, 


II 


z=o  Pn  dz 

f  Pn  d2Zj 


z= 0 


2  Pn 


Pi  dz~ 


=  4fzLr  +  4f2^ 

.  f 


VXn 


*=/ 


dz' 


+  - 


z=/y 


2  Pi 


z=f 
2 


d2X, 


V 


dz2 


Pi  d2Zn 


z=f 


Pu  dz2 


Z=fJ 


+ 


'  1  d3z, 

1  d3Zn 

\ 

p,  dz3 

*=/ 

Pn  dz3 

z=/y 

+ 


o(/5)- 


(P.15) 


The  first  two  terms  in  (P.15)  can  be  Taylor-series  expanded  to  give 


Afzll+Ar-'f 


z=f 


dz 


=  4  z\M  + A{'f  ■  f) 

=  Afx\^  +  A(ff  +  f1) 


Af ** 
dz 


+  3  order 


z= 0 


dZ 

dz 


•  (P.16) 


+  3ld  order 


z=0 


To  evaluate  the  0(/)  term  in  equation  (P.15),  use  equations  (P.34)  and  (P.35)  from 
Lemma  2: 


The  O  (/)  tenn  =  -/ 


1 


}#2  BfX 
tKXAxLf-^z- 


r  i  d3x 

p 2  dz3  \ 

z=/y 

-/ 


f  1  4 

f(  k2  (/  A  y\ 

Bf/, 

r  i  d3x 

{i/i  2Lf 

2/ 

[p3  dz3  \ 

z=/y 

+ 


-Mlf- Ax\,.r+B(ff) 


1  d3Z 

p3  dz3 


+ 


*=/ 


Now,  expand  in  /  on  each  side  of  the  interface: 
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The  0(/)  term |  =  -8k; f  ■  A Z\:__0  - 8k; f2A^ 


:=0 


+*(/•/) 


1  d3Z 

p 3  dz3 


(P.17) 


Jz=0 


We  still  need  to  evaluate  the  last  tenn  in  equation  (P.15),  the  O  (/2 )  term.  We  have 


The  O  (/2 )  tenn 


f  1  d'Z, 

1  d3Zu 

\ 

2 

{. Pi  dz3 

Z=f 

Pu  dz3 

z=/y 

£_ 

2 


2  Pn 


Pu 


Pi  dz3 


z=f 


dz3 


+  - 


z=/  J 


2Pi 


#Zi 


dz3 


Pi  &Xn 


Z=f  rn 


Pn  dz3 


z=/  J 


Now  use  Lemma  1  in  the  form  of  equations  (P.25)  and  (P.26)  to  get 


The  0(/2 )  tenn 


_/2  1 

'  dsX„ 

\ 

2  2  Pn 

{  Pu  dz3 

z=/y 

|  f~  %h  Pu  A  dZi 


2  2  dz 


z=f 


+ 


f-  1 


2  2p, 


f  8k20Bd3Zl 

,  f2  8 k;pu  {dZ: 

v  Pi  dz3 

z=/y 

2  2  pj  dz 

+ 


z=f 


=  ~<fB 


±_d_y 

p2  dz3 


+  4  klf2A 


Jz=f 


dZi 


dz 


+ 


Pu  ^Z 


z=f  Hi 


P,  dz 


z=f 


+ 


o  (fhff). 


dz  J 


and  so  the  O  (/2 )  in  equation  (P.15)  is 


The  0(/2)  term 

-4k;f2B 

z- 0 

r  i  d3z] 
y  dz 3  _ 

+  3rd  order.  (P.18) 


Jz=0 


Since  equation  (P.18)  is  already  at  2nd  order,  we  have  used  our  freedom  to  slide  J  down 
to  z  =  0 . 

Now,  take  equations  (P.16),  (P.17)  and  (P.18),  and  substitute  into  equation  (P.15).  This 
gives  us 


307 


1  dx, 


Pi  dz 


1  dXn 


z=0  Hll 


Pu  dz 


a/zL+a(//+/2) 


dz 


z= 0 


d Z 


-A-%Kfx\z=o-*Kf  A^-\ 

+s(//)r 


1  d3z 


p3  dz3 


dz 

+  4  KfA 


2  f2  a^X 


:— 0 


dz 


z= 0 


-4 Kf  B 


1  d'x 

p1  dz3 


+  0(f  f,f  f)  +  y‘  order. 


and  combining  terms 


1  dZ, 


P,  dz 


1  dZu 


z= 0 


Pi,  dz 


z= 0 


-AfXL-A-UZfzlM  +  A{f-f  +  f>) 

-4klf-A'Z  -AklfB 


dZ 

dz 


:=0 


dz 


z=0 


1  d3z 


z=0 


_L  <0x 

p1  dz3 


p~  dz3 

+  0(f  f,f-/)  +  r  order. 


(P.19) 


r— 0 


Now,  once  again  as  in  Sections  3.2.3  and  4.3  and  earlier  in  this  calculation  (just  above 
equation  (P.14)),  follow  Kuperman65:  let  z( 0)  =  (z) I  +  $Z  and  average.  This  gives  us 


1  d(Zt 


P,  dz 


1  d(zL 


=0  rn 


Pu  dz 


z=0 


Diffuse  (Bragg)  scatter  terms 


-Akl{f)A 


3{Z> 


dz 


z=0 


1  d3(z) 

p1  dz3 


(P.20) 


1  d3(Z) 

p1  dz3 


+  4th  order. 


:=0 


Note  that  we  have  used 


(/•/)= o  ;  (/  /')= 0 


as  discussed  in  Section  P.  1 .4  just  below  equation  (P.28).  We  have  also  once  again  made 
use  of  the  property  of  Gaussian  functions  that  /yoddP°wer\  =  0  . 


We  can  also  rewrite  the  last  term  as 
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i  Hz) 

p2  dz 3 


1  d3(x) 

p2  dz 3 


This  is  the  final  answer  of  this  section.  Result  (P.20)  indeed  provides  the  second 
boundary  condition  of  equation  (5.20).  This  boundary  condition  is  further  discussed  in 
Section  P.2. 


P.1.3  Lemma  1:  An  expression  needed  to  evaluated  differences  of  3rd 
derivatives 


In  this  subsection,  it  will  be  shown  that 


P„  d:'Z, 
P,  dz3 


dz 3 


8 k2B  d  'zfl 


z=f 


Pn  dz3 


+  8k0  pnA 


dz, 


z=f 


dz 


+ 


z=f 


o(7,7 ) 


(P.21) 


Note  that  this  result  is  only  good  through  0th  order  in  the  derivatives  of  / .  That  is  all 
that  is  needed  in  the  context  in  which  it  will  be  used.  The  results  throughout  Appendix  P 
are  only  good  through  2nd  order  in  /  and  its  derivatives,  and  the  results  of  this  lemma 
will  be  multiplied  by  a  quantity  that  is  already  2nd  order  in  /  and  its  derivatives. 


is  continuous  to  O  (/) ,  which  implies  that  the  transverse  derivative  of  this  quantity 
is  also  continuous: 


A 

f  \  d_X^ 

ssf-V 

1  dZ, 

1  dz, , 

/P  dz. 

-  z=f 

Pi  dz 

z =/  P„  dz 

z=f  _ 

o+o(7). 


(P.22) 


Since  t  =  (x  +  f  -z}j +  f2  ,  equation  (P.22)  becomes 


d_ 

dx 


1  dZ 

p  dz 


0  + 


°(7,7) 


z=f 


Now,  use  the  fact  p  is  range-independent  in  the  half-spaces  to  commute  it  with  d/dx , 
and  then  also  commute  d/dx  and  d/dz .  Next,  use  the  parabolic  equation:  dx/dx  =  ill X , 
and  this  time  note  that  H  is  independent  of  z  in  the  half-spaces,  and  so  commutes  with 
d/dz  (recall  that  the  density  and  compressibility  are  assumed  constant  in  the  half-spaces). 
This  gives  us 


~H,dXl~ 

H  „  dZ  n 

p,  dz  _ 

z=f 

1 

N 

1 _ 

o+o(7, 7). 


(P.23) 


Now,  recall  that  with  our  conventions 
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and  so  substituting  into  (P.23),  we  have 


1  P„  d  Z, 

,  hr,  dz, 

1  1  d3Zu 

2k0  p)  dz3 

z=/  Pi  dz 

z=f 

2k0  pu  dz3 

I  k0 flu  d%u 

Pn  dz 


\z=f 


o+o  (fj). 


and  pulling  out  /(2koPi) 

Pa  Pi  VXn 

Pi  dz3  z=/  pu  dz 3 


+  2  k0  Yj 


2, ..  fyi 


dz 


+  2k0/Jn 


Pi  dZ n 


-=/ 


Pu  dz 


0  +  0  (fj). 


This  gives  us  the  basic  equation 

Pu  d  ’Z,  I  Pi  d  'Z„ 


Pi  dz 3 


Pn  dz 


(  Yi  Pu ) 


dZ, 


z=f 


-ApuA 


'  dz 


Z=f 


=  0  +  0  (fj).  (P.24) 


Now,  let  us  get  a  couple  of  useful  alternate  versions  of  (P.24).  First,  start  with 

=  0  +  0  (fj) 


d3Z, 

+z 

Pi  d3Zu 

8k:p,,A ?>Zl 

V  Pi  ) 

dz3 

Z=f 

Pu  dz3 

z=f 

0  #  11  ^ 

z=/ 


8p/ 


'Pi 


to  get 


d'Z, 


dz3 


Pi  d3Zn 


z=f 


Pn  dz 


Sp  d3Z, 


Z=f 


Pi  dz  lz=/ 
8 k;B  d3z, 


+  8  k0pnA 


dZ, 


dz 


+ 


Pi  dz3 


+  8k~puA 


Z=f 

dZ, 


Z=f 


dz 


o(f.f) 
+0  (fj). 


(P.25) 


Z=f 


Now,  go  back  to  the  basic  equation  (P.24),  and  rearrange  it  to  get 


Pu  d3Z, 


p,  dz3 


d3Zu 

+ 

f  P  d 

1  -rJ-\ 

d3Zu 

z=f 

K  Pu) 

dz3 

^k^PuA 


dZ, 


Z=f 


dz 


0+0  (fj). 


z=/ 


Sp/ 


'Pu 


and  so 


Pu  d3 X ,  ^ Zu 

Pi  dz'  z=/  ?}Z"  *=/ 


+  8*„V„^  +0(/,/) 

Pu  &  r/  &  ,-f 

+«iP.A%-  +0(7, /)■ 

Pu  oz  dz  z  , 


(P.26) 


The  second  equality  is  the  form  of  the  result  quoted  in  equation  (P.21).  Both  (P.25)  and 
(P.26)  are  used  during  the  calculation  pursued  in  Subsection  P.1.1. 

P.1.4  Lemma  2:  An  expression  needed  to  evaluated  differences  of 
second  derivatives 

In  this  section,  it  will  be  shown  that 

=  Sklp,Ax\__f-Bfp,  +0(7, 7, 72).  (P.27) 

/  IP  Z  \z=f 

Note  that  this  result  is  valid  to  1st  order  in  the  even  derivatives  of  / ,  but  only  to  0th  order 
in  its  odd  derivatives.  For  reasons  to  be  discussed  below,  the  accuracy  of  (P.27)  will 
prove  sufficient  to  eventually  produce  a  stochastic  theory  good  through  2nd  order  in  / 
and  its  derivatives.  The  results  throughout  Appendix  P  are  only  good  through  this  order. 

We  begin  with  the  first  boundary  condition  in  (P.l). 


This  implies  that 


on  the  interface.  Thus,  taking  the  tangential  derivative  t  ■  V  =  d/dx  +  f  ■  d/dz  +  O  (/2  j  of 
this  equation  valid  on  the  interface  and  noting  that  the  density  p  is  constant  in  the  half¬ 
spaces,  we  wind  up  with 

jj %!_  dZn  +Bf\-^ ]  +Bf\-—{^ 

dx  z=/  dx  z=/  '  |_/?  dz Jz=/  '  ypdxydzj 


=  0  +  O(/,/2).  (P.28) 

z=  f 


d2Zi  Pi  d2Zu 

^  w  Pu  &2  „ 
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Note  that  we  are  already  dropping  the  O  (/)  term  (proportional  to  d/dz  )  in  anticipation 

of  the  result  we  now  derive.  In  the  context  of  the  calculation  presented  above  in 
Appendix  P,  result  (P.28)  will  eventually  be  multiplied  by  a  quantity  proportional  to  / 

and  averaged.  The  average  of  /  multiplied  by  odd  derivatives  such  as  //  ■  f\  or 
(/ '  /)  is  zero.  To  see  this,  consider  the  autocorrelation  function 

{f  (xi)  f  (xi))  =  J  dk  S{k)ek(x'~Xl) . 


Since  /  is  real,  S (k)  must  be  symmetric:  S(k)  =  S (—k) .  Take  d/dxt ,  and  set 
xi  =  x2  =  x  ■  This  gives  us 


(/•/)=  \dk  S(k)k=  0 


(/  /)=  I  dk  S(k)e=  0. 

_oo  '  V  '  odd 
even 


Thus,  in  the  context  of  the  calculation  presented  in  Appendix  P,  an  error  in  equation 
(P.28)  of  1st  order  in  its  odd  derivative  will  still  be  good  enough  to  eventually  yield  a 
result  good  to  2nd  order  in  /  and  its  derivatives.  Since  this  order  is  consistent  with  the 

accuracy  elsewhere  in  this  appendix,  let  us  drop  the  remaining  O  (/)  tenn  right  away. 
Note  that  equation  (P.28)  good  through  1st  order  in  /  and  its  even  derivatives  is  also 
sufficient  to  eventually  produce  the  desired  2nd  order  accuracy. 


Now  recall  that  we  are  an  infinitesimal  distance  into  the  half-spaces,  and  so  all  quantities 
are  effectively  constant,  and  d/dx ,  d/dz  and  the  Hamiltonian  H  freely  commute.  Use 
the  parabolic  equation  to  replace  all  dx/dx  with  HI X  and  divide  by  i .  Equation  (P.28) 
now  gives  us 


\H  nXn\z=f 


+  Bf 


1 

(h,9*) 

1  | 

+ - 

(  rr  dX„  ' 

ri  TT 

_pl 

l  dz  J 

z=f  Pu 

l  dz  J 

z=f  _ 

(P.29) 


=  0  +  0  (fjj2) 


Next,  as  in  Section  P.1.3  use 


H,  = 


Pr  V 


_  HII 


Pi 


+ KYi 


V2 

Tf//  —  —  Pu 
2  kn 
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and  multiply  through  by  2k0 . 


Let  us  concentrate  on  the  O  (/)  tenn  first: 


Bf 

2 


Bf 


Pn  d3 Z , 


p)  dz3 


+ 


1  #Xn 


*=/ 


Pn 


+ 


2K 7 dz, 


Z=f 


P,  dz 


-  2  k0  pn 


z=f 


1  dZn 


Pn  dz 


z=/  J 


1  dZi 

Pi  3-  lz =/ 


2Pl 


Pn  <BX, 


P,  dz3 


+ 


Pi  d*Zn 


r-f  r  II 


Pn  dz3 


Z=f 


+ 


Bf 


2K  {7,-Pn) 


f  f_dzf 

ypdz  j 


+o(/) 


+  2nd  order. 


Consider  this  term  first 


Now  note  that 


Pn  d3Z, 


P,  dz3 


+ 


Pi  d3Zn 


Pn 


dz3 


_  Pi  Pn 

Pn  d3Zi 

z=f  \ 

PiPn 

Pi  dz3 

=  Plptt 

'  1  d3Z, 
Pi  dz3  z 

=  2  P:PU 

1  d3Z 

1 

NJ 

Qj 

N 

UJ 

1 _ 

+ 


Pi  d3Zn 


+  ■ 


Pn  ^3 

1  d 3Zn 


z=f 


Pii  ^z3 


Z=f 


Z=f 


Z=f 


and  we  have  for  the  O  (/)  term  in  equation  (P.29)  (times  2k0 ): 


2  h-Bf 
~  Bf  Pn 


±n, 

rdZ,^ 

+—H„ 

fdZ„2 

/>, 

l  dz  J 

Z=/  Pii 

l  dz  J 

z=f  _ 

±_lz_ 

p2  dz3 


+  Bf-k20{yI-pII ) 


z=f 


1  dZ 

p  dz 


+  ■ 


(P.30) 


The  other  term  in  (P.29)  becomes  (recall  we  are  multiplying  by  2 k0): 

2K  {Ha I  -  HnXu  )  =  —KZi  -  KZn  +  2tir,Z,  +  2KPuXu  • 

Pi 


(P.31) 


Substituting  (P.30)  and  (P.31)  into  (P.29)  (times  2 k0)  gives  us 
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o  =  —KXi  -  Kzn  +  2 kfaXi  +  2 klunXu 
Pi 


+BfPu 


±_lx_ 

p2  dz3 


+  Bf  ■  k{)  (  yl  Pu ) 


J_dx 

p  dz 


+o  (fjj1) 


or 


Pn  d2Zi 


Pi  & 2 


d2Zn 


z=f 


dz2 


2^o  Yi  X /  z_y  2k0  pn  xu  z=  f  Bf  p 


2=/ 


1  d3x 


-Bf  ■  Iffy  I- Pu) 


_fdx 

p  dz 


+ 


p1  dz3 

O  (fjj2). 


Z=f 


(P.32) 
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Now,  at  z  =  /  (the  interface),  we  have  from  the  first  boundary  condition  (P.l) 

■0(/=) 


r  _  y  I  Xi  Xll  _  y  B  f 
Xl  X  2  X  2 


1 

p  dz 


+  1 


y  _  y  Xl  Xu  _  y  Bf 

Xu  X  2  X+  2 


±dx 

p  dz 


+ 


o(,n 


and  so  at  z  =  /  (the  interface)  we  have 

-2k2  ( y,  Xl  |z=/  +  Pn  Xu  if )  =  -2k2  ( y,  +  pn )  xLf 

-/tfi-K+Pulff 
=  -2k2(yI  +  pII)xLf 

+tf{yI-Pn)Bf 


1  dz 

p  dz 


+ 


o(/2) 


2=/ 


(P.33) 


\_d_X_ 
p  dz 


+ 


o  (f2\ 


-I2 =/ 


Substituting  (P.33)  into  (P.32),  we  see  that  the  two  terms  proportional  to  d%/dz  cancel 
and  we  are  left  with 

Pu  ?)2Xi 


Pi  &2 


d  Zn 


Z=f 


dz2 


■~2k2  ijt+Pu) X\z=f~Bfp, 


z=f 


-ApuA 


1  dJZ 

p2  dz3 


+ 


o  (fjj1) 


z=f 


or 


P11  d2Zi 


Pi  dz2 


d2Xu 


dz2 


=  ^klp „  A  x\^f  ~  Bfp 


z=f 


J_^x 

p2  dz3 


+  0 (fjj2).  (P-34) 


z=f 
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Multiplying  through  by  pj pn  ,  we  reproduce  equation  (P.27): 

-Of/. /./').  (P.35) 

*=/ 

Both  equations  (P.34)  and  (P.35)  will  be  used  in  the  calculations  pursued  in  Sections 
P.1.1  and  P.1.2. 

P.2  Discussion  of  the  dj/dx  boundary  condition 

Subsection  P.2.1  examines  some  superficially  surprising  aspects  of  the  d^/dx  boundary 
condition  (equation  (P.20)),  and  Subsection  P.2.2  reinforce  the  veracity  of  the  result. 
Subsection  P.2.3  sums  up  the  discussion. 

P.2.1  Unaesthetic  aspects  of  the  dj/dx  boundary  condition  for  quasi- 
first  order  theory 

The  structure  of  equation  (P.20)  seems  to  suggest  that  the  d^/dz  boundary  condition  in 
the  detenninistic  result  (5.16)  (or  equivalently  (P.l))  is  missing  a  tenn  of  the  form 

Bf2\\  frl  •  (p-36) 

p  dz3  , 

L  r  J z-f 

In  the  stochastic  problem,  this  tenn  would  become 

i  ym 

p- 

and  when  introduced  as  an  additional  tenn  into  the  stochastic  boundary  condition  for 
d  (%) /dz  (equation  (P.20)),  it  would  provide  the  usual  cancellation  with  the  tenn 

proportional  to  (j"  -fj  .  If  present,  term  (P.36)  would  also  maintain  the  rule  that  the 

transverse  (i.e.,  traditional)  Lamb  shift  (i.e.,  the  part  generated  by  smearing  alone)  can  be 
obtained  by  substituting  -4A///2\  for  f 2  into  the  “vacuum  polarization”  terms  (i.e., 

downrange  Lamb  shift)  for  the  deterministic  tilted  interface  and  setting  f  =  0 .  This  rule 
is  essentially  equivalent  to  the  assertion  that  we  can  naively  take  the  procedure  for 
distorted  volume  given  in  Subsection  3.2.2,  and  apply  it  to  an  interface  z  =  / ,  where  / 
now  measures  the  distortion  of  a  range-independent  surface.  This  procedure  involves 
expanding  the  Hamiltonian  in  a  Taylor  series  to  get  a  stochastic  roughness-induced  term 
(see  equation  (3.16)  and  the  related  discussion): 


1  d  3z 


p 3  dz3 
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(P.37) 


(. f2)d2(H ) 

~2  dz' 


However,  expression  (P.37)  will  not  generate  the  term 


B 


1  53(J) 

p1  dz3 


that  appears  in  equation  (P.20) — for  exactly  the  same  reason  that  term  (P.36)  is  absent. 
Up  to  now  expression  (P.37)  has  worked  even  when  singularities  are  present,  but  this 
time  the  absence  of  the  term  (P.36)  has  led  to  an  exception  for  this  rule  of  thumb. 


Furthermore,  as  it  stands  equation  (P.20)  suffers  from  another  rather  unaesthetic  trait.  To 
demonstrate  that  the  two  terms 


_L^ 

p 1  dz 


and 


B(f-f) 


i  y{z) 

p2  dz3 


conserve  energy,  one  would  have  to  engage  in  formal  gymnastics:  Basically,  we  would 
need  to  add  extra  8m  -  and  VTS'  Vr  -tenns  to  the  Hamiltonian  to  bring  these  boundary 
conditions  into  our  stochastic  Hamiltonian  as  Hermitian  contact  potentials. 


P.2.2  The  “unaesthetic”  result  is  nevertheless  correct 

These  two  aesthetic  shortcomings  cause  some  concern  that  there  might  be  some  subtle 
error  in  the  reasoning  that  produced  equation  (P.20). 

The  latter  concern,  namely  that  the  argument  establishing  energy  conservation  for  the  two 

tenns  proportional  to  [l / p2  ■  d3  l^x)/dz3^  turns  out  to  be  unusually  subtle,  might 

incline  us  to  conclude  that  these  terms  were  somehow  erroneously  introduced  into 
equation  (P.20).  However,  there  is  simply  no  solid  reason  to  conclude  that  these  terms 
are  artifacts.  They  clearly  fall  out  of  the  calculation. 

Concerning  the  first  issue,  namely  the  surprising  absence  from  the  detenninistic  result  of 
the  term  (P.36),  it  is  relatively  unlikely  that  there  is  some  fundamental  flaw  in  our 
technique,  which  caused  us  to  accidentally  drop  this  “missing  term”.  The  only  unusual 
aspect  of  the  calculation  that  produced  the  deterministic  result  was  the  use  of  S  -function 
bifurcation  to  evaluate  products  of  distributions  (i.e.,  generalized  functions),  but  this 
technique  only  affects  tenns  that  are  2nd  order  in  the  density  jump  ( O  [  Sp1 ) )  or  of  higher 
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than  2nd  order  in  the  slope  f ,  while  the  missing  tenn  f2B^l/ p2  ■  33;jf/3z3  J  already 
shows  up  at  1st  order  in  ^  and  at  2nd  order  in  /kkkk. 

Thus,  we  would  be  well  advised  to  accept  equation  (P.20)  as  is,  and  not  allow  it  to  be 
trumped  by  some  preconceived  notion  of  what  it  should  be.  The  soundness  of  this 
decision  is  further  reinforced  by  the  observation  that,  in  a  sense,  the  missing  term  is 
already  hidden  in  our  current  result.  Examining  equation  (M.12)  and  the  argument 

leading  up  to  it,  we  see  that  —Xj% kl ,  the  tenn  that  generates  the  “vacuum  polarization” 

contribution  to  the  deterministic  result  (5.16)  (or  equivalently  (P.l)),  contains  a  hanging 
(or  invisible)  contribution: 


kkkk  It  is  easy  to  verify  that  we  are  not  overlooking  other  terms  that  are  1st  order  in  dp  ,  2nd  order  in  f  and 
its  derivatives,  might  involve  8  -function  bifurcation,  and  may  in  some  way  substitute  for  the  “missing 
term”.  For  example,  in  principle  8  -function  bifurcation  can  affect  terms  that  are  2nd  order  in  the  curvature 

(i.e.,  0(/2)).  However,  f~  does  not  mimic  the  behavior  of  f~  in  either  the  deterministic  or  the 
stochastic  problem,  so  O  {f2  )  terms  are  not  proper  substitutes  for  the  missing  O  [  f2  j  term.  At  first 

glance,  terms  of  O  ( f  •  f  Spj  are  more  promising  substitutes  for  the  O  ( f28p )  term,  since  such  terms 

can  mimic  the  each  other’s  behavior  in  the  stochastic  problem.  However,  it  is  hard  to  see  how  terms  in  the 
“vacuum  polarization”  series,  where  the  surface  function  is  pulled  out  by  taking  some  number  of  range 

derivatives  of  the  step  function  0  (z  —  could  possibly  lead  to  something  proportional  to 

0[f  •  f  Sp  j  .  Thus,  this  backdoor  channel  for  generating  something  in  the  deterministic  problem,  which 
later  functions  like  the  missing  term  once  we  get  to  the  stochastic  problem,  is  not  available  either.  Thus, 
the  problem  term  is  indeed  O  (  f'Sp^  ,  and  not  O  ( j~Sp )  or  even  O  [f  •  f  Sp )  .  Of  course  anything 

o(4o2)  comes  in  addition  to  the  already  problematic  O  (8p^  term.  At  order  O  ( f  Sp )  ,  8  -function 
bifurcation  has  no  relevance,  and  so  it  also  has  no  relevance  to  the  “surprising”  behavior  arising  from  the 
absence  of  the  O  {j^Sp^  term. 
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*=/ 

(note  that  as  always  p0  is  chosen  to  be  pn  ).  This  term  does  not  contribute  to  quasi- 1st 

order  theory,  because  it  will  not  contribute  to  a  single  or  double  infinitesimal  integration 
(see  Subsection  3.3.3  for  the  conversion  from  contact  potentials  to  boundary  conditions 
using  infinitesimal  integrations).  However,  in  the  0(A2,Xj  theory,  it  will  show  up  as  a 

boundary  condition  on  d^/dz  that  is  O  (f'2Sp'j ,  just  like  the  “missing”  term  we  have 
been  seeking.  Indeed,  it  is  easy  to  show  that  we  now  have  the  cancellation  that  has  been 
lacking  from  our  formalism1111.  The  boundary  condition  that  was  absent  from  O  ( A,  A ) 

(quasi- 1st  order)  theory  has  now  effectively  migrated  back  into  0(A2,Xj  theory.  The 

effect  that  was  related  to  its  absence  in  quasi- 1st  order  is  still  there,  but  it  has  shifted  up  to 
the  boundary  condition  on  the  third  derivative.  We  never  get  rid  of  it;  it  just  migrates  just 
like  any  of  our  effects  generated  by  contact  potentials.  The  effect  appears  to  be  real,  and 
it  constitutes  a  “downrange  Lamb  shift”  (i.e.,  “vacuum  polarization”)  contribution 
associated  with  a  density  jump. 


~XX¥-VTS'(z-f) 


16  k, 


_Ldj 

=  P°f~B  S"{z  f ) 

_Ldj 

p2  dz 

2k0 

p2  dz 

P.2.3  The  bottom  line 

Thus,  the  bottom  line  is  that  the  contribution  to  the  stochastic  boundary  condition  on  the 
first  derivative  (i.e.,  equation  (P.20))  that  is  proportional  to  (f-f)Sp  is  surprising  and  a 


1111  The  related  term  — ■  B  )y/(2A:0)  S'(z  —  f)  (l  //?' )  ,  which  contributed  to  the 

discontinuity  in  X  m  O  (  A, /T  j  theory,  now  in  O  (  theory  directly  modifies  the  boundary 

condition  on  the  second  derivative  d~j)f/dz2  .  When  we  use  the  usual  procedure  to  evaluate  the  stochastic 
boundary  condition  on  d  (x) /dz ,  the  usual  expansion  that  we  use  to  project  down  to  the  z  =  0  line 

generates  a  term  proportional  to  f  (^  X/  /dz-  —  ^  X n  /dz-  )  .  The  new  contribution  to  the  difference 

between  the  second  derivatives  will  in  turn  generate  a  new  term  proportional  to  f  ■  f  8p  ,  which  once 
averages  are  taken  provides  the  long-awaited  cancellation  with  the  now-present  term  proportional  to 

f2Sp. 
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little  suspicious,  but  for  now  we  have  to  accept  it  as  is,  and  keep  an  open  mind  as  future 
research  either  confirms  or  refutes  this  finding. 

When  performing  a  numerical  calculation  of  the  Lamb  shift  for  a  realistic  shallow  water 
environment  (Section  5.4),  we  will  employ  the  following  practical  strategy:  Since  it  is  the 
most  important  contribution,  and  by  far  the  easiest  to  evaluate,  we  will  concentrate  only 
on  the  component  of  the  acoustic  Lamb  shift  that  involves  the  wave  function  %  and  its  1st 
derivative,  and  drop  the  terms  that  involve  the  2nd  and  3ld  derivatives  (this  includes  the 
“vacuum  polarization”  tenn)  from  our  initial  assessment  of  the  significance  of  the 
acoustic  Lamb  shift.  For  future  use,  Section  P.3  discusses  the  stepping  algorithm  for  the 
full  quasi- 1st  order  stochastic  theory  that  includes  the  terms  that  do  involve  the  2nd  and  3rd 
derivatives  (including  the  “vacuum  polarization”  effect). 


P.3  A  downrange  stepping  procedure  of  the  quasi-1st  order 
result 


Fiere,  we  consider  stochastic  quasi-lst  order  (i.e.,  o(A,X}  )  theory  as  given  by  boundary 

conditions  (P.  14)  and  (P.20).  We  either  drop  the  loss  terms  associated  with  Bragg 
scattering  and  just  consider  the  downrange  propagation  of  the  coherent  field,  or  we 
evaluate  these  terms  by  writing  the  locally  generated  incoherent  field  Sx  as  some 
constant  times  the  incoherent  field  (see  a  brief  discussion  in  Subsection  3.2.2  just  below 
equation  (3.16)  and  also  reference  [64]  for  more  on  this  general  approach).  The 
stochastic  theory  is  good  to  2nd  order  in  the  surface  height  and  its  derivatives,  and  at  this 

order,  it  involves  the  mean  square  surface  height  (^f2'j  and  the  mean  square  slope  . 
As  throughout  this  paper,  the  surface  is  assumed  to  be  Gaussian. 

Only  the  Hamiltonian  away  from  the  interface  is  directly  involved  in  downrange 
stepping,  and  the  lead  transverse  derivative  in  this  Hamiltonian  comes  from  (X)  and  is 

2nd  order:  32(;if)/3z2  .  The  contribution  contributes  a  tenn  proportional  to 

in  the  boundary  conditions  at  the  interface.  This  and  some  other  terms  (associated  with 
downrange  smearing)  in  the  stochastic  boundary  conditions  (P.20)  are  proportional  to  the 
2nd  and  3rd-order  transverse  derivatives  of  (x)  ■  These  terms  create  tricky  issues  for  the 
stepping  algorithm,  and  in  this  section,  we  will  develop  a  stepping  algorithm  that 
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addresses  these  issues1™™1™1.  This  procedure  will  not  be  used  in  Section  5.4,  since  there 
the  terms  that  involve  the  2nd  and  3rd  derivatives  will  simply  be  dropped. 

Assume  that  we  have  just  stepped  on  side  /  (above  the  flat  interface),  and  we  have  (x,) 
just  above  the  interface  (and  of  course  also  (%)  at  all  grid  points  away  from  the 
interface).  Now,  use  the  0th  order  (in  /  and  its  derivatives)™11”  boundary  conditions  to 
evaluate  (x)  on  both  sides  (i.e.,  (Xu)  =  (Xi )  )•  Next  use  finite  differences  to  calculate 


mmmm  y|lc  deterministic  problem  would  proceed  in  a  similar  fashion.  There  is  some  added  complexity, 
since  the  interface  is  tilted  and  so  in  general  we  would  need  to  evaluate  H X  on  both  sides  of  the  interface. 
On  the  other  hand,  we  would  benefit  from  some  simplification,  since  the  deterministic  problem  does  not 
involve  3rd  derivatives.  It  is  not  worth  examining  the  deterministic  case  in  detail,  because  it  is  unlikely  that 

this  problem  will  be  solved  numerically  using  quasi-lst  order  theory.  As  shown  in  Section  6.1,  O  ) 
theory  introduces  effects  that  are  far  more  important  than  the  effects  introduced  by  the  O  (  j  term.  In  the 
0(/l~)  theory,  the  wave  function  X  *s  continuous  and  it  remains  so  when  the  O (X)  term  is  added. 


Furthermore,  in  the  O  )  theory,  the  lead  order  is  a  4th  derivative,  and  the  2nd  and  3rd  derivatives  from 
the  O^/ij  raise  no  special  issues.  Thus,  theory  is  perfectly  well  behaved,  and  there  is  no 

reason  to  worry  about  the  extra  complexities  that  emerge  in  O  (/l, A  )  theory.  On  the  other  hand,  in 


stochastic  theory,  the  O  (A)  component  already  leads  to  the  problems  discussed  here  in  Appendix  P.3,  and 
we  cannot  avoid  these  issues. 

nnnn  Since  these  terms  eventually  get  multiplied  by  terms  that  are  already  2nd  order  in  f  and  its  derivatives, 
0th  order  in  f  and  its  derivatives  is  all  we  need  at  this  stage.  In  stochastic  theory  for  a  Gaussian  surface, 
terms  that  are  0th  order  in  f  and  its  derivatives  come  with  an  error  term  that  is  2nd  order  in  f  and  its 
derivatives.  The  reason  is  that  for  a  Gaussian  surface,  the  expectation  values  of  odd  powers  in  the  surface 
function  (or  its  derivatives)  are  zero. 
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3  (Zu ) jdz  and  then  use  the  0th  order  (in  /  and  its  derivatives)""™  boundary  condition  on 
3(^)/3z  to  find  3(j/)/3z: 

3h;>_l 


3z 


~d(Zu) 

kPuj 

dz 

+  2  order. 


Now  use  finite  differences  to  obtain  32  )/3z  ,  and  a  stochastic  version  of  the  implicit 

boundary  condition  (P.34)  to  get  (to  0(/° ))"""" 


3 2  (Zl 


dz 2 


z= 0 


A/32(^/ 

A  3z2 


-8A:02/?7/^(j)|  +  2nd  order. 


z= 0 


(Recall  that  at  O  (/° ) ,  )  =  (/f77 )  =  {z)  •)  Since  we  now  have  second  derivatives  on 

both  sides,  we  can  take  the  average 


1  32(J) 

yO  3Z2 


-4 klA(z)  _o+2nd  order. 


We  can  also  use  finite  differences  to  get  33  (, ^ )/3z3 .  We  still  need  to  evaluate  the 
average 


1  33(J) 

/r  3z3 


To  do  so,  note  that  (to  O  (/° ) )"""" 


r  i  d3uy 

_  1 

1  d3(Zi)  1  33(j7/) 

p 2  3z3 

~2 

z=0 

p)  3  z3  pi  3 z3 

1 

Pii 

[p„V(xZ 

+  33fe) 

2p77 

Pi 

l  Pi  3z3  j 

dz3 

order. 


(P.38) 


Now  use  the  stochastic  version  of  the  implicit  boundary  condition  (P.26)  projected  down 
to  the  z  =  0  line.  Again,  this  is  valid  to  0th  order  in  /  """".  The  boundary  condition  is: 


Pii^Ui ) 

8k20Bd3(ZlI) 

+  8  klpnAd^Xl^ 

Pi  3z3 

z=0  ^ 

z=o  Pa  dz* 

z—0 

8  k~B 


P 


gX Zl 


n  J 


dz 3 


+  8  k0pnA 


:=0 


dU) 

3  z 


+  2  order. 


z=0 


Now,  substitute  it  into  (P.38): 
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1  V{x) 

p-  dz3 


Pn 


2  PI 


Pi 


lz=0 

6  K+ 

d\Xu) 

V  Pll  ) 

dz 3 

+  8  kip„A^L^Pl 


:=0 


Pi  ^ 


*{Zn 


/dz 
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d'jZi, 

dz3 
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2  Pi 


l+PlL 

1  d3(Zu) 

+  8k2pI/Ad(Z/1 ^ 

Pi 

v  Pll  y_ 

dz3 

Z-0 

z-0 


L 

known 

known 

Next,  plug  in  our  values  for 

[l  d2{z) 

p  dz2 

and 

z=0 

\  1  d3{X) 

p2  dz3 

z— 0 

+  2  order. 


into  boundary  condition  (P.14)  to  get  (zn)  to  C)(/2)  (we  already  know  (Xi )  to  this 
order).  Now  get  3  l^Xu  ) jdz  using  finite  differences,  and  then  use  boundary  condition 
(P.20)  to  get  3  [Zi)/dz  .  Finally,  use  finite  differences  to  get  32  ^Xi)/0z  and  so 
{H MZi)  •  Now,  we  can  step  and  start  the  procedure  all  over  again! 


Q  Appendix:  The  boundary  conditions  for  0(/i2)  theory 

In  this  appendix  (as  throughout  this  paper  when  the  focus  is  on  the  interface  itself),  we 
only  consider  1 -dimensional  interfaces  embedded  in  2-dimensional  spaces,  where  x  is 
the  range  and  z  is  the  depth  (as  always,  the  positive  z  -axis  points  up  into  region  /  ). 
The  (now)  scalar  transverse  derivative  Vr  and  3/3 z  are  used  interchangeably. 


Q.1  The  boundary  conditions 


In  this  section,  we  derive  equation  (6.15).  The  O  (/l2 )  equation  of  motion  is  given  by 
equation  (6.14)  rewritten  below: 


-M-k 

l  ~  A-0 
OX 


r  A  A2  ^ 


1-2/+  —  -  , 

v  K  j 


X- 


(Q.l) 
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In  this  section,  we  let  S  =k0(y- ju) ,  and  so 


A 

A2 


A,Vr(jr)Vr  m 

^  _ 1_ 


Po 


2k0 


4  k: 


+ 


A)Vr(i)Vr„  ,  „  PoVr(jr)Vr  ,  „ 


2L 


,  + 


2L 


-K 


(Q-2) 


The  basic  strategy  for  extracting  boundary  conditions  from  equation  (Q.l)  is  as  follows. 
Recalling  that  for  the  2-dimensional  problem,  Vr  =  d/dz ,  we  take  0,  1,2,  and  3  indefinite 

integrations  J  dz  ■  ■  •  followed  by  an  infinitesimal  integration  at  the  interface  z  =  /  (x) : 

f+e 

J  dz-- 

f-e 

Since  the  lead  order  term  involves  an  operator  of  the  form  Vr  (^)  V2T  (^)  Vr ,  we  will 
also  need  to  multiply  by  p  after  the  first  and  third  indefinite  integration.  Note  that  the 
lead  order  term  is  the  most  important  tenn  as  far  as  this  procedure  is  concerned.  The 
difference  between  the  order  of  this  tenn  and  the  total  number  of  integrations  (counting 
both  the  indefinite  integrations  and  the  definite  integrations  over  infinitesimal  intervals 
that  cross  the  interface)  will  determine  the  lead  order  of  the  derivatives  involved  in  the 
resultant  boundary  condition.  Also  note  that  this  term  will  never  be  collapsed  by  an 
explicit  internal  S  -function.  In  other  words,  the  final  definite  integration  will  always 
involve  the  integral  of  a  derivative,  and  so  it  will  always  generate  the  difference  between 
some  operator  involving  the  wave  function  evaluated  on  the  two  sides  of  the  interface. 

Let  us  proceed  and  see  how  all  this  works  in  practice. 

Performing  3  indefinite  integrations  followed  by  1  definite  infinitesimal  integration 
across  the  interface,  the  lead  order  term  Vr  (j/p)  V2  (}p)VTZ  gives  something 

proportional  to  Xi  ~  Xu  ■  To  get  a  contribution  from  elsewhere  in  equation  (Q.l)  that 
survives  this  battery  of  4  integrations,  we  would  need  a  tenn  that  is  capable  (in  principle 
at  least)  of  generating  a  8"'(z  -  f  (x)) .  By  inspection,  there  are  no  other  terms  in  A2 x , 

Ax  or  k0  (l  -  2y)  that  will  do  so0000.  Thus,  in  0(  A2 )  as  in  0(  A )  theory,  we  have  the 


0000  Folded  in  here  is  the  hidden  assumption  that  a  field  can  at  most  step  at  the  interface,  but  it  cannot 


contain  8  -functions  at  the  interface.  Such  field  behavior  is  itself  unphysical,  and  furthermore,  consistency 
with  the  wave  equation  would  force  the  field  to  pick  up  an  even  more  outrageously  unrealistic  infinite 

series  of  8”  -functions,  with  the  n  standing  for  derivatives  of  unbounded  order. 
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boundary  condition  Xi  =  Xu  ■  Similarly,  2  indefinite  integrations  followed  by  one 
definite  integration  yields  the  boundary  condition 

1  dZ,  _  1  din 

p!  dz  pn  cjz 


Now,  we  need  to  do  2  (an  indefinite  and  a  definite)  integrations  and  1  (definite) 
integration  to  get  boundary  conditions  involving  rfi^/Oz2  and  d^/dz3  respectively.  In 
addition  to  the  contribution  from  the  lead  order  term,  non-zero  contributions  will  also 
come  from  other  terms  if  they  in  principle  generate  S'(z  —  f)  and  S(z  —  f)  functions 


respectively.  Next,  we  must  examine  equations  (Q.  1)  and  (Q.2),  and  look  for  terms  that 
might  generate  such  distributions.  Given  the  boundary  conditions  we  have  just  obtained 
for  x  ar|d  y,MA  ,  it  follows  that  VT  (^)  VTZ  has  at  most  a  step  as  do  the  parameters  S 
and  y .  Thus,  we  see  that  only  one  tenn  in  addition  to  the  lead  order  term  is  capable  of 
generating  the  needed  S'  -  and  S  -  functions.  This  means  we  need  only  perfonn  our  two 
integrations  [  dz  ■  ■  •  on  the  reduced  equation 


Po 


4  kl 


+  - 


Po^  1 


2k, 


(^z) 


(Q.3) 


Including  the  other  terms  demanded  by  equations  (Q.l)  and  (Q.2)  would  only  introduce 
tenns  that  immediately  fall  out  upon  integration. 


f+e 


Thus,  we  begin  by  performing  a  single  definite  integration  J  dz---  on  equation  (Q.3). 
This  gives  us 


f-e 


Po  Y73  -v  Po  V73 


V3  r  -V  r  -V  r 

2  v  tAi  2  tAii  tAi  ^ II  v  tAii 

Pi  Pii  ,  Pi  Pii 


4k, 


■  + 


0 


2k, 


=  0. 


o 


Now,  multiply  through  by  2k0  and  set  p0  =  pu  .  Define  CX,  =  [Po/P\I=Pn/PI-  A1so 
note  that 

“/  =  ^0  {Ti~Pi)  ~  Kh 

“//  =  ^o{Yii~Pii)  ~  —^0  Pii 


since  p,  =  yu  =  0  for  our  standard  conventions.  This  leaves  us  with 
/y2V3  y  —  V3  y 

1  tX‘  tZu  +  K  aji  ■  Vr  Xi + hPn  ■  VtXii  =  0  . 

2  k0 

This  is  indeed  the  first  boundary  condition  in  equation  (6.15). 


(Q.5) 
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Now,  we  obtain  the  second  boundary  condition  in  equation  (6.15).  We  first  perfonn  a 
single  indefinite  integration  of  equation  (Q.3).  Once  again,  it  is  convenient  to  multiply 
through  by  2 k0  and  divide  by  p0 .  This  leaves  us  with 


1 

"  1 

/v-v2r 

-VTZ 

p 

IP  J 

2k0 


+  -VT{ZZ)  =  0. 
P 


f+£ 

Now,  multiply  through  by  p  and  perform  the  definite  integration  J  dz--  to  obtain 

f-e 


P0 


VL  Pi 


V2rZ 


1  7  2 

^  tX  II 

Pii 


2  kn 


-  +  (Zi-Zu)x  =  0. 


where  use  has  been  made  of  the  fact  that  the  field  is  continuous:  Xi  =  Xu  =  X  •  Now,  set 
A)  =  Pii  ■>  and  rearrange  terms  to  obtain 


(VrXi-VrXn)  (l-^)KZi 


2  L 


2L 


+  (Zi-Zn)x  =  0. 


Note  that 


piA 

=  Pu_ 

f 

1- 

Pi  ) 

V  Pi  J 

Pi 

V 

Pii  j 

=  2  a#, , 


and  recall  equation  (Q.4)  to  show  that  H/  - Zn  =k0(yI+  pn) .  This  leaves  us  with 

(KXi-KXn)  ,  2 ajrtXi 


2  L 


2  K 


-+h{r,+Pu)x  =  Q, 


(Q.6) 


which  is  slightly  rearranged  version  of  the  second  boundary  condition  in  equation  (6.15). 
We  have  thus  succeeded  in  reproducing  all  four  boundary  conditions  in  equation  (6.15). 


Q.2  Separating  out  the  terms  that  build  Bragg-scale  vorticity 

In  equation  (6.16),  the  terms  in  the  first  two  equalities  of  equation  (6.15)  (i.e.,  those 
directly  related  to  the  0(  A2 )  contribution)  are  rearranged.  In  this  appendix,  we 
demonstrate  that  the  boxed  terms  in  equation  (6.16)  can  indeed  be  identified  with  the 
emergence  of  Bragg-scale  vorticity.  Specifically,  we  demonstrate  that  contrary  to  what 
might  be  expected  based  on  the  1st  order  boundary  conditions  (see  equation  (Q.7) 
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below)pppp,  the  terms  in  the  boxes  modify  the  boundary  conditions  on  V"T%  (n  >  2  )  so 
that  the  behavior  of  the  auxiliary  field  X  does  not  mimic  the  behavior  of  the  pressure 
field  A  .  This  difference  reflects  the  need  to  construct  Bragg-scale  vorticity.  The 
assertions  in  this  paragraph  involve  some  subtlety;  so  let  us  take  a  minute  to  examine  the 
situation  more  closely  before  proceeding. 


Q.2.1  What  we  want  our  theory  to  include 

At  1st  order,  the  auxiliary  field  X  has  acquired  the  boundary  conditions  that  would  be 
expected  for  the  pressure  field  A  : 

~p^ ’tXi  —  ~p^ ’tZii  ^ 

%i  ~  Xu 


For  the  sake  of  this  discussion,  assume  that  the  problem  is  range  independent  and  that 
there  is  a  flat  horizontal  interface  where  the  density  jumps.  Note  the  continuity  condition 
Xi  =  Xu  >  and  recall  that  ^  is  a  carrier  of  the  downrange  flux  and  not  a  pressure  field. 

Since  the  problem  is  range  independent,  the  operator  H'2  now  commutes  with  d/dx  and 


with  the  Hamiltonian  H  (to  demonstrate 


=  0  ,  expand  X  in  eigenvectors: 


X  =  XC»X„  and  note  that  H~'Ax„  =  X„/y[K  by  definition,  where  Hx„  =  Enxn ). 


Therefore,  we  can  now  also  use  the  parabolic  equation  for  X  to  propagate  u  =  Il  '^X 
instead.  Among  other  things,  this  means  that  u  and  its  transverse  derivative  du/dx  are 
continuous  on  the  interface,  as  is  (1/ p)du/dz  .  Noting  that  in  the  half-spaces, 

X  =  ■  H'^A  with  a  =  p0/ p  locally  constant,  we  have  in  the  half-spaces 

u  = , Ja/k0  ■  A  =  Jp0/(pk0)  ■  A ,  and  so  we  get 


A  1  dA  1  dA 
—j= ,  —j= — ,  — y-  —  all  c  ontinuous 
yjp  fP  dx  p  2  dz 

c  AdA  S,  A  dA 

S - — - t-—  both  continuous 

p  ox  p  p~  dz 


(Q.8) 


By  contrast,  the  full  wave  boundary  conditions  imply  that 


pppp  By  the  way,  these  generally  survive  unchanged  in  the  sets  of  boundary  conditions  corresponding  to  2nd 
and  higher-order  theory.  (We  just  saw  this  for  2nd  order  theory.) 
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A, 

Sx 


dA  1  dA 

—  ,  —  —  all  continuous 
ox  p  dz 

A  dA 

- —  not  continuous:  S. 

p  dx 


A  dA 

- continuous 

p  dz 


(Q-9) 


The  PE  conserves  energy,  but  the  boundary  conditions,  and  the  horizontal  and  vertical 
fluxes  have  been  redistributed  relative  to  the  full  wave  result  (at  least  in  1st  order  theory; 
this  particular  result  will  generalize  to  higher  orders  since  it  turns  out  that  equation  (Q.7) 
remains  unchanged  in  higher-order  theory).  We  could  impose  continuity  on  A  and 
consequently  on  its  transverse  derivative  dA/dx  and  restore  Bragg-scale  vorticity 
(discontinuity  of  the  transverse  velocity  (1/ p)  (dA/dx) )  by  hand.  To  do  so,  we  add  to  the 
1st  order  Hamiltonian  H  =  k0-  2 yk0  +  A  an  extra  (energy  conserving)  term 


1 

2 K 


■—r,s'(z)z 

p 


y  _  Pu  Pi  _  Po  P 
2P„  2  A, 


with  p0  =  pn  \p  =  pj 


(Q.10) 


Note  that  we  are  following  the  conventions  and  definitions  outlined  below  equation 
(6.15).  p  and  now  even  %  turn  out  to  be  discontinuous  to  1st  order  in 

2r  =  (Po  -  p)/p0  =  ~AP/Po  =  {Pn  -PiMPu  =  5p!Pu  > 


and  we  should  in  principle  bifurcate  S'  (z)  to  interpret  this  tenn.  However,  to  0th  order 

the  two  variables  are  continuous,  and  so  to  1st  order  (since  we  are  multiplying  by  an 
additional  2/j )  we  can  choose  either  %,  or  Xu  anc'  pI  or  pn  (or  some  value  in 
between). 


Furthermore,  we  have 
A  = 


_  HQ 


v 

v  T  p  v  T 

2^o 

Po  V2tZ 
P  2£0 


%  +  terms  that  have  no  ^-functions 


+ 


fP^ 

P 


VTZ 


2  L 


+  • 


The  second  term  on  the  right  hand  side  of  the  second  line  contains  a  simple  8  -function 
that  will  not  survive  an  indefinite  integration  followed  by  an  infinitesimal  integration11™. 
Thus  we  drop  this  term  as  well  as  terms  that  contain  no  8  -functions.  Now,  we  multiply 


qqqq  Implicitly,  we  are  again  either  truncating  at  1st  order,  or  bifurcating  8  -functions  when  we  make  this 
statement. 
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through  by  2  k0  \pj pQ)  and  find  that  the  key  terms  for  the  discontinuous  part  of  the 
boundary  condition  on  x  are 

Kz-r^'{z)x+---, 


and  taking  the  infinitesimal  integration  across  the  interface 

Xi-Xn-YiX  =  0 


or 


Xi  = 


Xu 

Xu 


v  2Pn  j 

\ ^ 

v  2Pu 


+ 


higher  order 


or 


(Q- il) 


which  is  indeed  the  boundary  condition  we  would  expect  for  a  quantity  that  is  roughly  the 
square  root  of  the  component  of  the  energy  flux  that  is  transverse  to  the  interface. 

Note  that  we  want  a  term  such  as  defined  in  equation  (Q.10)  to  emerge  from  our  theory, 
and  not  just  to  be  imposed  by  hand.  The  reason  is  that  although  the  energy  is  conserved 
(the  term  is  Hermitian),  we  have  no  control  over  what  such  a  tenn  does  to  the  boundary 
condition  on  or  on  the  other  boundary  conditions  (i.e.,  conditions  on  V"^; n  >  2 

that  are  all  implicit  in  1st  order  theory).  Furthermore,  actual  jumps  in  the  wavefunction 
cause  problems  because  now  the  x  -derivative  of  x  will  spawn  a  8  -function,  and  this  in 
turn  would  lead  to  a  very  problematic  feedback  loop  in  our  boundary  conditions.  Thus, 
we  also  want  the  jump  in  X  1°  naturally  migrate  up  to  a  condition  on  the  higher-order 
derivatives.  This  will  happen  if  a  tenn  proportional  to  a  S'  -function,  such  as  is  given  in 
expression  (Q.10),  is  embedded  in  a  higher-order  theory.  Therefore,  we  now  examine 
our  0(  A2 )  theory  and  discover  how  a  tenn  of  the  form  (Q.10)  is  indeed  generated  by  our 
theory  (along  with  other  similar  terms). 


Q.2.2  For  comparison:  The  boundary  conditions  on  A 

Once  again,  consider  a  flat  horizontal  interface  (i.e.,  the  range-independent  problem).  As 
discussed  above  equation  (Q.8),  in  this  case  the  WKB  amplitude  (i.e.,  the  square  root  of 
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the  Hamiltonian)  associated  with  the  transformation  between  the  pressure  field  A  (which 
obeys  the  Helmholtz  equation)  and  the  auxiliary  field  X  (which  obeys  the  PE)  no  longer 
plays  a  significant  role  in  the  problem  since  it  cancels  at  the  endpoints.  In  fact,  in  this 
case  the  auxiliary  field  u  =  A/ Jp  ,  which  is  X  with  the  WKB  amplitude  peeled  off,  also 
obeys  the  parabolic  equation  (noting  that  1 j A Jp  is  expressed  as  a  function  of  the 
expansion  parameter  2 y ,  expanded  and  truncated  appropriately).  Also  note  that  this 
factor  of  l/  Jp  is  associated  with  Bragg-scale  vorticity.  Therefore,  we  have  an  instance 
where  only  the  presence  of  Bragg-scale  vorticity  distinguishes  the  solution  to  the 
Helmholtz  equation  and  the  solution  to  the  PE.  Therefore,  we  will  now  derive  the 
boundary  conditions  on  the  pressure  field  A  ,  and  compare  them  to  the  boundary 
conditions  on  x  •  To  facilitate  this  comparison,  the  boundary  conditions  on  X  will  then 
be  rewritten  as  the  sum  of  a  part  that  reproduces  the  boundary  conditions  on  A  ,  and  a 
part  that  generates  Bragg-scale  vorticity. 


Q.2.2.1  The  boundary  conditions  on  A  and  its  first  derivative  are  the 
same  as  those  on  x  and  its  first  derivative 


We  know  that  A  obeys  the  2-fluid  boundary  conditions 

4  =  4/ 

1  dAj  _  1  dAn 

p,  dz  pn  dz 


(Q.12) 


Since  these  boundary  conditions  also  apply  to  X  (see  equation  (Q.7)  and  note  that  it  turns 
out  that  these  conditions  apply  to  higher-order  PE’s  as  well),  Bragg-scale  vorticity  must 
emerge  from  the  higher-order  boundary  conditions.  To  observe  the  emergence  of  Bragg- 
scale  vorticity,  we  therefore  need  to  examine  the  boundary  conditions  on  higher-order 
derivatives  of  A  and  X  -  and  then  note  the  differences. 

For  the  solution  to  the  Helmholtz  equation  (with  its  lead  z  -derivative  2nd  order),  these 
higher-order  boundary  conditions  are  implicit,  and  we  will  have  to  derive  them  using  the 
explicit  boundary  conditions  and  the  Helmholtz  equation  itself. 


Q.2.2.2  The  boundary  condition  on  V3rA 

Let  us  proceed  to  examine  the  boundary  condition  on  V3rA  (still  considering  the  range- 
independent  problem).  Take  the  wave  equation  in  the  half-spaces  and  approach  the 
interface;  then  multiply  by  (l/p)VT: 
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x  (w2T  +  k])AI 


*  (V;+*;)4 


d  (  d  ) 

■ - — Aj 

dxydx 

d  (  d  A 

= - — An 

dx  l  dx 


(Q.13) 


Now,  subtract  the  two  equations.  Since  p1  and  pn  are  constants,  the  difference  between 
the  two  expressions  on  the  right  hand  side  of  the  “  =  ”  signs  gives: 

_ifiLLv.4~Lv  J) 

dx\dx  p,  pu 


Noting  that  here  V  T  =  d/dz  ,  equation  (Q.  12)  already  tells  us  that 


—  VrAj — —VTAn  =  0 


everywhere  on  the  interface.  Since  the  interface  is  horizontal,  d/dx  is  a  tangential 
derivative  relative  to  the  interface  (i.e.,  (l  -  hh)  ■  V  =  d/dx  ).  Thus, 

—  (constant  on  the  interface)  =  0 
dx 

d 2 

and  also  -^-(constant  on  the  interface)  =  0. 
dx 


So  the  right  hand  side  of  the  difference  of  equations  (Q.13)  is  zero.  This  leaves  us  with 


V/Ai  ,  k 


ViA, r  .  K 


+  ^-VrAr  +  lva 


Pi  Pi 


Pii  Pn 


(Q- 14) 


P,  Pu  P  X  Pi 


(kl-kl)- cL 

Pu 


=  0  .(Q.15) 


Note  the  cancellation.  Recall 


k1  =  n2kl  =  %  ■  C°y  and  c0 


and  so 
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kj-k l  =  [n]-\)k{ 


f  2  A 

Vc/  . 
f 


Ml 

kPhKj 
f 


-1 


1-A 

P 


a  J 


=  -2/A2 


k2  -  k1 

KII  Ko 


=  (n2n-\)k{ 


f  2 

V  cii 
f 


j 


\ 


P a  K a  _  ^ 

V  PiiKi  J 


xMn 


K 


i  J 


—  '2-fiIIk0 


(Q.16) 


Thus,  we  have 


Pi  Pii 

and  multiplying  through  by  pn  / (2 k0 ) 


vta,  via,,  .  ,  2  vta,  .  ,  2  vta ,,  . 

- T—^~  -  2/jkg  T—L  +  2pnkg  LJL  =  0 . 


Pi 


Pn 


GCjVjA,  -yrAn 


2  kn 


k0aIyIVTAI  +  k0junS7  TAn  =  0 . 


only  this  sign 
will  be  different  in 
the  Vj'Z  condition 


(Q.17) 


Note  that  the  sign  on  the  second  term  is  the  only  difference  between  this  boundary 
condition  on  V3A  and  the  corresponding  boundary  condition  on  V3 x  given  in  (Q.5). 


Q.2.2.3  The  boundary  condition  on  V2TA 


Now,  let  us  repeat  this  for  the  boundary  condition  on  V\A .  Now,  consider  the 
Helmholtz  equations 


(Vl+k])A,= 

I V  + 1  a )  I II  =- 


f 


dx 

d_ 

dx 


A 2  a 

dx  , 

dx  11 


and  subtract  the  second  equation  from  the  first.  Furthermore,  note  that  A,  =  Au  on  the 
horizontal  interface.  This  implies  that 


dA1 

II 

dx 

dx 

d  1 

(dAt 

=  0 

rk  ' 

^  dx 

dx  ) 

tangential 

derivative 
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This  leaves  us  with  (noting:  AI  =  An  =  A) 

V2TAI-y2TAII+(k2I-k2II)A 

y2TA,  -  yj.A/f  +  (k2  -  kl  )A-  (k2n  -kl  )A  =  0 


Substituting  the  results  of  equation  (Q.16)  and  dividing  by  2 k0  gives  us 


-  k0rIA  +  koJuI1A  =  0. 

this  sign  is  the  lowest 
order  part  of  the  difference 
between  the  Vj-A  and 
boundary  conditions 


VtAi 


2L 


(Q-18) 


Once  again,  note  that  the  sign  on  the  second  term  is  a  difference  between  this  boundary 
condition  on  V2d  and  the  corresponding  boundary  condition  on  y'1  X  given  in  equation 
(Q.6).  In  the  equation  for  V2^  there  is  also  an  additional  higher-order  tenn 
)/k0 .  This  term  too  is  associated  with  Bragg-scale  vorticity. 

Q.2.3  How  the  distinction  between  x  and  A  is  reflected  in  their 
boundary  conditions 

Now,  let  us  collect  the  boundary  conditions  involving  V2  x  and  y2r%  (as  given  in 
equation  (6.15);  and  equivalently  in  equations  (Q.5)  and  (Q.6)),  and  the  corresponding 
(implicit)  boundary  conditions  on y\A  and  y2TA  along  a  flat  (i.e.,  /  (x,y)  =  0  )  interface 
(equations  (Q.  17)  and  (Q.  18)).  Take  the  two  boundary  conditions  involving  X  >  and 
separate  out  those  parts  that  reproduce  the  corresponding  boundary  conditions  on  the 
pressure  field  A  ,  and  place  the  leftover  parts  in  boxes.  This  gives  us  equation  (6.16) 
rewritten  here  as  equation  (Q.  19): 

y  tXi  T%Il\—  k(PCiY^  TXl 

2k^\y tX I  ~  ^ tXii\  K  ( ft II  ~  Tl)Z 


+  k{]Hj^y  tX  n  +2k0cr/y/Vr^-j 


+2*0  hX  +  ^lX: 


=  0 


(Q.19) 


Reversing  the  basic  argument  that  led  to  equation  (Q.l  1),  it  is  not  difficult  to  show  that 
the  S'  -function  responsible  to  1st  order  for  the  extra  part  of  the  V2^  boundary  condition 
is  (-1/2 k0)«7/cT(z) .  This  is  the  same  as  in  equation  (Q.10).  In  obtaining  equation 


(Q.19)  from  this  S'  -function,  we  also  picked  up  an  extra  factor  -(2 k0)~  that  comes  from 
peeling  off  part  of  the  coefficient  in  front  of  the  new  leading  order  derivative 


-A2/2k„ 


-(l/2^0)!(l/2t„)Vr(->/)vq^)Vr 


+  •••. 
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It  is  not  difficult  to  show  how  (-l/2k0)  ay,8'  (z) emerges  from  the  two  cross-tenns  in 

V  ( V)  V 

(-1/2*.)  V?  (Kr)x  (Q-20) 

2ka 

and  part  of 

V  ('V')  V 

(-1/2  k0)^f~^(-koJu)Z.  (Q-21) 

2k0 


Let  us  examine  the  second  tenn.  With  our  conventions  that  =  K:  and  p0  =  pn  (useful 
if,  for  example,  medium  /  is  water  and  medium  II  is  mud),  we  find 


1 

r  ^ 

l 

K) 

i — 

2 

~2 

l  Kj) 

and  so  in  region  I ,  p  =  0  and  in  region  II , 


K,  -  K„  8K  A K 

p  =  — - -  = - = - =  P„ 

2  K,  2  K,  2  K, 


or 


PlI  ~  r. 


KnPn 


KlP 


iHu  y 


f  c2A 

V  cu  J 


Thus,  with  the  positive  z  -axis  pointing  into  region  /  : 


~KP  =  -y©(-z) 


k  f 

2  V 
k  f 

-f®H) 


v  c//y 


1-4— ^ 

c//  A/y 


1-n2 

1  "total 


V 


A 
y_ 

A/J 


where  «total  is  the  index  of  refraction  between  the  media  and  0(z)  is  the  Heaviside  step 
function.  Now  n2tal  =  1  -  2//total ,  where  //lolal  is  the  familiar  parameter  measuring  the 
sound  speed  jump  between  the  media,  and 

Pi  _  Pi  _  1  _  x  SP  | 

Pn  Pi+#P  1  +  f  Pi  Pii 

to  1st  order  (recall  our  conventions  pu-  p1-8p  =  -Ap  ).  Thus 
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r  A. 

1  total 


Pn 


1  2/hotai 


5p_ 

Pn 


to  1st  order.  This  gives  us 


-koJu  = 


1-2  M**~  — 


A@(-z)  1 

2  V  V 

k  (  ^ 

2  l  P 


Pn)) 


-k0®(-z) 


dp 

Aotal  +  ~ 

V  2/? 


A 


//y 


(Q-22) 


Note  that  equation  (Q.22)  also  implies  that 


8p 

Pi,  =Aotal+^-  =  Aotal  +  r/ 
2P„ 


(Q-23) 


to  1  order. 


We  also  have 


1 

7=- 

2 


\-P- 

v  Ay 


Pn  ~  P 


_  y  a 


2  Pn 


(Q.24) 


Thus,  /  =  0  in  region  II  and 

Pn~P,_Sp_  Ap_  = 

2  Pn  2pu  2pn  1 


in  region  /  .  Thus 


r=rMz). 


(Q.25) 


Combining  (Q.22)  and  (Q.25),  we  have 

~KP  +  K  7  =  — ^o®(— z)  Aotai  +  V/  [©(z)  -  0(~z)]  (Q-26) 

at  1st  order  for  our  standard  example  K0  =  KI  and  p0  =  pn  .  From  equation  (Q.23),  we 
have  pn  -  yI  =  //total ,  and  it  is  straightforward  to  show  that  the  first  term  in  equation 
(Q.26)  when  substituted  into  the  cross-term  (Q.21)  reproduces  the  unboxed  part  of  the 
V2t%  boundary  condition  in  equation  (Q.  19)  (i.e.,  the  part  the  reproduces  the  behavior  of 
the  pressure  wave  A  ,  and  so  does  not  remove  Bragg-scale  vorticity).  This  leaves  us  with 
the  term  k0y[  [©(z)  -  @(-z)]  .  From  equations  (Q.20)  and  (Q.21),  we  see  that  this  gets 
operated  on  by 
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(-1/2  K) 


Vr(a/Z)Vr 


2k, 


to  form 


k  y  [ a20(z)  g20(~2)" 

2 k0  p  2 k0  0  1  v  dz2  dz2  y 


Note  that  Vr  operating  on  1  j p  generates  a  simple  S  -function,  which  will  not  survive  a 


double  integration  across  an  infinitesimal  integral,  and  so  this  operation  has  been  dropped 
from  the  present  calculation.  Finally  use 


d20(z) 

dz2 


S'{z) 


S'(-z) 


d20(-z) 

dz2 


to  recover  the  term  (Q.10): 


2  k0  p 


R  Appendix:  The  electromagnetic  field 

The  appendix  supplies  the  details  behind  the  results  presented  in  Section  7.1.  Appendix 
R.l  derives  equation  (7.4),  the  state  space  equation  for  the  electromagnetic  field. 
Appendix  R.2  derives  equation  (7.6)  (with  definition  (7.7)),  an  approximate 
transformation  connecting  the  familiar  full-wave  magnetic  field  HT  with  the 

corresponding  auxiliary  field  dT  that  is  associated  with  downrange  propagation  by  the 
parabolic  equation.  Together,  these  two  results  provide  enough  infonnation  to  allow  the 
straightforward  mechanical  implementation  of  the  formalism. 

Appendix  R.2  also  derives  a  “minimal  version”  of  equation  (7.6)  applicable  when  the 
grazing  angle  and  the  changes  in  the  electric  pennittivity  and  magnetic  permeability  are 
all  small. 


R.l  The  state  space  equation  for  the  electromagnetic  field 

Section  R.  1 . 1  sets  up  the  needed  formalism  for  electromagnetic  fields,  and  identifies  the 
conventions  used  in  this  calculation.  Section  R.  1 .2  identifies  the  Foldy-Wouthuysen 
ansatz  appropriate  for  electromagnetic  fields.  Two  lemmas  needed  in  the  calculation  are 
derived  in  Subsection  R.1.3,  and  the  resulting  state  space  equation  is  derived  in  Section 
R.l. 4 
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R.1.1  The  basic  electromagnetic  formalism 


To  derive  the  state  space  equation  (7.4),  we  begin  with  Maxwell’s  equations  (in  MKS 
units): 


V-D  =  pf 


VB  =  0 
Vxl  =  -icopH 


(R.1) 


VxH  =  i  cos  E  +  J  f 


where  E  is  the  electric  field,  El  the  magnetic  field,  and  p  j  is  the  free  charge.  For  a 

linear  isotropic  medium,  the  electric  displacement  D  is  given  by  D  =  sE  ,  the  magnetic 
induction  B  by  B  =  pi  I  and  the  free  current  J f  by  J  f .  =  oE  .  The  electric  permittivity 

S  is  related  to  some  reference  value  S0  by  s  =  s0  +  8s ,  while  the  magnetic  penneability 
is  related  to  its  reference  value  p0  by  p  =  p0  +  Sp.  The  conductivity  <J  is  set  to  zero  in 
the  current  context1™)  although  a  nonzero  value  would  involve  only  modest  modifications 
to  the  current  formalism. 

Only  the  last  two  of  Maxwell’s  equations  (R.l)  will  be  used  here  in  Section  R.  1.  In 
addition,  the  second  Maxwell  equation  will  be  used  below  in  Section  R.2,  but  the  first 
Maxwell  equation  is  not  used  in  our  treatment.  Thus,  technically  speaking,  our  treatment 
makes  no  assumption  concerning  the  free  charge  pf .  Furthennore,  the  treatment  here  in 

Section  R.  1  does  not  assume  that  the  medium  is  necessarily  linear  and  isotropic,  but  the 
treatment  in  Section  R.2  will  assume  B  =  pH  ,  a  property  of  a  linear  isotropic  medium. 

The  energy  flux  is  given  by  the  Poynting  vector: 

S  =  E  xH  =  energy  flux  vector . 


Throughout  this  work,  the  physical  field  is  actually  the  real  part  of  a  complex  field. 
Under  these  circumstances,  the  time-averaged  Poynting  vector  becomes 


Save  =  ^Rc(/:  |  x  Re(/?)^>  =  —  Re  ExH*  =  average  energy  flux  vector  . 


(R.2) 


As  always,  we  consider  duct-like  (i.e.,  waveguide)  propagation,  where  the  x  -axis  defines 
the  downrange  direction.  A  subscript  T  on  a  vector  indicates  that  the  vector  is 
embedded  in  transverse  y-z  space. 


m‘  When  p j  =  0  and  J  f  =  0  ,  then  the  medium  is  a  dielectric. 


336 


R.1.2  The  Foldy-Wouthuysen  ansatz  for  an  electromagnetic  field 


Next,  let  the  discussion  at  the  beginning  of  Appendix  I  guide  us  in  constructing  an  ansatz 
that  will  lead  to  a  state  space  equation  that  is  a  suitable  starting  point  for  the  Foldy- 
Wouthuysen  (FW)  transfonnation.  Specifically,  start  with  the  basic  form 

(e\ 

<h  = 

UJ 


and  construct  9  and  X  from  the  original  field  in  such  a  way  that  <£>  77<f>  =  \0\  -  \x\  is 

proportional  to  the  (time-averaged)  downrange  flux  Sx  =  S.lvc  -x.  As  noted  in  Appendix 

I,  this  will  guarantee  that  the  total  energy  flux  <f>  77<f>  remains  conserved  at  all  values  of 
the  range,  and  this  in  turn  will  force  the  matrix  Hamiltonian  H  to  be  pseudo-Hermitian. 
The  Hamiltonian  in  the  state  space  equation  must  be  either  Hennitian  or  pseudo- 
Hermitian  for  the  equation  to  constitute  a  suitable  starting  point  for  the  Foldy- 
Wouthuysen  transformation  (see  the  second  to  last  paragraph  in  Subsection  C.2.3.2  or 
footnote  yyy  for  indications  of  why  this  is  so). 


With  these  considerations  in  mind,  the  suitable  ansatz  is 


r 9t > 

1 

f 

1 —  X 

£n  „  - 

(?  = 

Ht± 

1-5- -jc  xE 

Jt) 

~2 

{  ' 

iMo  J 

(R.3) 


where  HT  is  the  projection  in  (transverse)  y-z  space  of  the  magnetic  field  H .  The 
vector  <J>  is  underlined  with  a  tilde  to  indicate  that  it  now  has  4  rather  than  2  components. 


The  factor  of  ^£0/jU0  has  been  inserted  into  the  ansatz  so  that  the  units  work  out 
properly.  Below,  this  factor  will  sometimes  be  denoted  by  a  =  .  It  is  easy  to 

verify  that  <E>  ij<X  indeed  represents  the  uprange/downrange  flux: 


=  9*  ■9t-xt-Xt=  Re 


H  rp 


■  {xxE) 


=  Re 


H*  \xxE) 


Re 


■( ExH *) 


2  [uprange/downrange  flux]  ■ 


(R.4) 


There  is  one  small  new  twist:  this  time  it  is  9T  that  represents  the  downrange  flux  (and  so 
is  of  primary  interest  to  us),  and  XT  that  gives  us  the  uprange  flux.  (This  suggests  that  in 
this  one  case,  we  should  use  the  e‘M  convention  rather  than  the  usual  e~'M  convention.) 
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Also  note  that  we  made  use  of  the  basic  identity  that  the  vector  triple  product  can  be 


rotated  (for  arbitrary  vectors  A  ,  B  and  C): 


2(f?xC)  =  det 


'A' 

B 

A 


b\CxA)  =  C-(AxB). 


(R.5) 


Before  proceeding,  let  us  note  in  passing  that  we  could  just  as  easily  chosen  the  ansatz 

if  t-  A  ' 


<t>  = 


feT A 


tJ 


Et±  —  xxH 


V 


J 


and  the  whole  derivation  below  would  work  just  as  well. 
Next,  we  prove  two  lemmas  that  will  be  needed  later. 


R.1.3  Two  lemmas 


Lemma  A  (equation  (R.6)) 


H. 


OOaju 


\@t  Xt) 


(R.6) 


Proof:  Start  with  0T  —  %T  =  a  (x  x  E )  and  observe  that  since  there  is  no  x  component  to 
dT  and  Xt  ■>  '  (#r  ~  Xt )  =  ^ '  (&t  ~  Xt )  •  Thus,  we  have 

=  -a 


=  —a 

=  -a(x-  (- i(Oju)H ) 


V-(Ixxj 

Rotate  the 
triple  product 

f  \ 

x-  Vxl 

Use  Maxwell's 
V  third  equation  > 


=  icoajuH  x  QEB 
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Note  that  the  third  Maxwell  equation  in  (R.  1)  was  used.  We  also  made  use  of  the  basic 
vector  identity  (R.5),  now  being  careful  to  note  that  Jc  is  constantssss. 

Lemma  0  (equation  (R.7)) 


— -  VrHx -—-—Ht=Vt- VtHt  -  VT  -S7tH't  +  co2juHt 
c)x  £  ch  £  dx  £  £ 

As  always,  unless  explicitly  noted  otherwise,  repeated  indices  are  summed. 

Proof:  Begin  with  the  fourth  Maxwell  equation  in  (R.  1) 

Vx#  =  i(0£E  or  -Vx#  =  icoE  , 

£ 


take  the  curl,  and  use  the  third  Maxwell  equation  to  simplify: 


Vx 


-Vx// 

£ 


■■  icoVxE  =  (0  /j.El . 


Note  that  equation  (R.8)  is  the  wave  equation  for  the  magnetic  field  H  . 
Next,  use  the  basic  vector  identity 

Ax(BxC)  =  (A-C)b~(a  b)c  =  ABC  -[A-b)c 


(R.7) 


(R.8) 


(R.9) 


with  A  =  V  ;  B  =  ±V  ;  C  =  H  to  obtain8888 


1-  - 

.  1  -  .  , 

(-  1  ^ 

Vx 

—VxH 

=  V-VH'-\ 

V--V 

£  1 

l  e  J 

Substituting  into  (R.8),  we  now  have: 

V'-V/T 

£ 


(-  1  ^ 
V-V 
\  e  ) 


H  =  ar/uH . 


Taking  the  transverse  projection  (1  -  xx)  ■  (. . .) , 


ssss  In  general,  we  will  continue  to  use  standard  vector  identities  like  (R.5)  and  (R.9)  even  when  some  of  the 
vectors  are  operators,  as  long  as  we  are  sure  to  keep  all  vectors  being  operated  on  to  the  right  of  the 
operator.  To  maintain  this  convention,  it  will  at  times  prove  necessary  to  express  vector  products  in  terms 
of  sums  involving  the  components.  In  this  section,  and  always  throughout  this  paper  unless  explicitly  noted 
otherwise,  repeated  indices  are  summed. 
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or/nHT  =  V'  —VtH‘  - 
£ 


(-  1  -3 
V-V 

l  £  ) 


ht. 


and  then  explicitly  making  the  replacement  V  =  x3/3x  + Vr ,  we  have 


3  1 


3  1  3 


,  1 


co/uHt  =  ~ — Vr//x  -  - — — Ht  +  V‘T  —VtH't  -  VT  -—VtHt  , 


dx  £  dx  £  3x  £  £ 


and  rearranging  terms,  we  have  (R.7). 


R.1.4  The  state  space  equation 

Following  Sections  C.1.1  and  I,  recall  that  the  derivation  of  the  state  space  equation 
requires  that  we  develop  two  separate  intermediate  equations,  one  involving 

3  (0T  -  XT)/dx  and  being  essentially  a  rewrite  of  the  wave  equation,  and  the  second 

involving  3  [0r  +  Zt)/^x  ar|d  being  a  manipulation  of  the  definitions  in  the  ansatz. 

These  two  equations  will  be  derived  below,  the  former  being  equation  (R.  14)  and  the 
latter  (R.  16).  These  are  derived  in  Subsections  R.  1 .4. 1  and  R.  1 .4.2  respectively.  In 
Subsection  R.1.4. 3,  these  two  partial  differential  equations  are  combined  to  form  a  state 
space  equation,  which  is  then  streamlined  in  Subsection  R.  1.4.4. 


R.l.4.1  The  equation  for  3(<9r-jr)/3x 


To  obtain  the  equation  involving  3  [0T  -  %T  j  jdx  (equation  (R.  14)),  start  with 


dT—  Xt  =  a  x  x 


E 

Replace  using 
the  4th  Maxwell 
equation 


(R.10) 


=  — xx(Vx//) 
i(0£  v  ' 


Again  use  A x(f?xC)  =  A'BC'  - (2  ■  B^C  this  time  with  A  =  x  ;  B  =  V 

xx(VxH)  =  VH  H 
v  ’  dx 


C  =  H: 


(R.11) 
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and  substitute  into  (R.10)  to  get 


@T  Xt 


a 

iCO£ 


vthx 


dflr 

dx 


Multiplying  through  by  i  and  taking  d/dx ,  we  have 


dx 


Xt) 


dx  £  dx  £  dx 


Now,  substitute  Lemma  B  (equation  (R.7))  into  the  right  hand  side  of  (R.13)  to  get 


dx 


{0T  Xt) 


CO 


VT  —V‘tHt  - VkT—VTHk  +  o)2juHt 
£  £ 


or 


dx 


(@t  Xt) 


j  a 

CO 
a 
co 


1 


1 


V‘T—  W'tHjt  -V*  —  WjrHk  +co2juH 
£  £ 


%  -%8jk  -  V*  -  VJT  +  of  juSjk 
£  £ 


Hi 


■  %jkHk  . 


Substituting  dT  +  %T  ~  HT ,  we  have  successfully  recast  the  wave  equation  into  the 
desired  form: 


dx 


(ot-Xt)j  =  Zjk (oT + xT)k  or  i^(eT-xT)  =  z\dT+xT), 


with 


£jk  =—  VT  -%Sjk  -  V*  - VJT  +  CO2 juSik 
CO  £  £ 


sr  a 

G=  — 

-  1  -  7 

IV  •— Vr  +1  co  n~ 

V  ivr 

transpose 

CO 

£ 

£  \ 

(R.12) 


(R.13) 


(R.14) 


(R.15) 


341 


Note  that  these  operators  are  properly  Hermitiantttt.  This  gives  us  the  first  part  of  the  state 
space  equation.  The  wave  equation  for  H  is  embedded  in  result  (R.15).  It  entered  via 
equation  (R.8)  in  the  proof  of  Lemma  B  (equation  (R.7)). 


R.l.4.2  The  equation  for  d[GT+xT)/dx 


Next,  let  us  obtain  the  equation  involving 


d(9T  +  Xt) fix  (equation  (R.  16)).  This  is  the 


second  equation  we  will  need  to  obtain  the  state  space  equation,  and  it  emerges  as  we 
manipulate  the  definitions  in  the  ansatz  (R.3).  We  start  by  once  again  invoking  equation 
(R.12),  this  time  written  in  the  form 


coe 

a 


Xt  )  T  >^tHx 


.dHr 

1  dx 


Now  substitute  for  Hx  using  Lemma  A  (equation  (R.6))  and  again  substitute 
0T  +  Xt  =  • 


coe 

a 


—i 


(Da /u 


^  t  '  fir  Xt  ) 


d(0T  +  XT) 

dx 


or 


'• a  — ,+  - )  =  —  V 't  -  Vr  (er  -  zr )‘  + — ( eT -x, ) ' 

ox  COa  [A  'a 

d(0T  +  Xr)J 

dx 


—  VJT-VkT+  —  8ik 

COa  ju  a 


[0T  Xt) 


Thus,  the  equation  involving  d [0T  +  Xt)/^x  's 


i^^-  =  CJl{er-Xr)1  or  (R.16) 


tttt  In  this  case,  taking  the  Hermitian  conjugate  means  taking  the  transpose,  and  reversing  the  order  of  the 
differentiations.  In  principle,  we  should  also  perform  complex  conjugation,  but  that  is  not  an  issue  here. 
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where 


C]k=— V'-V*+— 8Jk  ■  C  =  — Vr-Vr  +  —  1.  (R.  17) 

(da  fl  a  (da  fl  a 

Once  again,  these  operators  are  Hennitiantm.  Equation  (R.  16)  gives  us  the  second  part  of 
the  state  space  equation. 


R.l.4.3  Combining  component  partial-differential  equations  to  form  a 
matrix  state  space  equation 


Together,  equations  (R.14)  and  (R.16)  are 


d  |  Oj  + 
dx 


%)=?*(% 

^l=r(er‘ 


(R.18) 


Now,  let  us  add  and  subtract  these  equations.  Adding  the  two  equations  and  dividing  by 
2  gives  us: 


.30'  {£*  +  (*)-,  ((“-(“) 

'17= — 5 — 0T+ — 2 — 


(R.19) 


while  subtracting  the  first  equation  in  (R.18)  from  the  second  and  dividing  by  2  gives  us 


M  (**-(*)*, 

dx  2  T  2  Zr' 


(R.20) 


Putting  these  two  equations  in  matrix  form  gives  us 


.  d  dj- 


dx 

Mt 


dx 


J  L 


(iJk-CJk)f  o  n  (tjk+Cjk)n 

2  l-l  0+  2  0 


0" 


(  qk \ 

\J y 

{ZrJ 


(R.21) 


Note  that  as  j  =  1,2 ,  we  indeed  pick  up  all  4  components  of  /  04>  /dx ,  although  perhaps 
not  in  the  usual  order.  Also  note  that  we  have  once  again  identified  the  matrices  TJ  and 

%  (as  first  defined  in  Section  3.1).  Temporarily,  there  is  a  tilde  on  top  of  £  to 
distinguish  it  from  the  matrix  %jk .  These  two  matrices  are  completely  different,  but  they 
only  appear  together  briefly,  since  %jk  will  shortly  disappear  from  the  scene. 
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To  get  a  Hamiltonian  in  standard  form  7i  =  O  +  6  +  kjj ,  we  need  to  pull  out  a  k,J]  (this 
time  multiplied  by  the  identity  matrix  S'k  )  from  the  term  in  (R.21)  that  is  proportional  to 


V- 


1  dx 

■  dir 

V  dx  ) 

_ 

f  nk  \ 

uT 

\tT) 


(R.22) 


Now,  let  us  define  new  variables  so  that  the  answer  (R.22)  looks  more  like  the  result  for 
the  2-fluid  acoustic  case.  Specifically,  define  AJk  such  that  is  the  coefficient  of  the 
matrix  £  and  so  the  primary  expansion  coefficient.  Also  define  yjk  such  that 
AJk  -2  kjjk  is  the  coefficient  of  //  in  the  even  operator  £  .  This  gives  us 

Pjk  _  r  jk  yjk  cjk 

Ajk  =z - and  YJk  =  -  — - b  — — .  (R.23) 

2  /  2k0  2 


Substituting  for  %jk  and  £Jk  using  equations  (R.15)  and  (R.17)  respectively,  we  have 

pjk  _  yjk 

AJk  =  ^ - 5— 


2 

=  —V'  —V'  Sjk  -  —  VA’  —  vj 

2  co  T  e  T  2  co  T  e  T 


1 


1 


f 


-Vyr-V"+- 
2  coa  //  2 


acOjU  - 


v 


coe 
a  ) 


8jk 


(R.24) 


and 


gjk  gjk 


2k Q  2 


— - — VJT-VkT 

2  k0(Oa  [A 


coe  Sjk  SJk 

- 1 - . 

a  2k0  2 


(R.25) 


Now,  let  us  streamline  equations  (R.24)  and  (R.25).  To  do  so,  we  obtain  four  useful 
identities. 


R.l.4.4  A  streamlined  version  of  the  state  space  equation 

Identities  1  and  2  are  closely  related.  First,  recall  that  a  =  yJe0/jU0  and 

Se  =  e- e0  ;  SjU  =  jU-^0. 


Therefore, 
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coe 


coe 


coe,  ^  +  co{e-e0)t 


8e 


0 

Se 


=  + — co^eju, 


O)/c0=k0 


-'O 


0)/  Cq  —k0 


IXr. 


f .  Se^ 

1  +  — 

V  £oJ 


and  similarly 


aO)jU=I^O)jU  =  O)^£jT0+  (O  (//-//0) 

'A.  — '  \Mo  ' 


W£oA)  _ 

M>  A> 


A 


azy//  =  Ay 


£// 


1  +  ^ 

v  ft; 


Identities  1  and  2  follow  immediately: 

coe  . 

acoju - =  k0 

a 


coe 


f  S/j  Se'' 
v  A 


i+* 

v  fo; 


Identity  #1 


Identity  #2 


Identities  3  and  4  are  trivial: 

a  _  l£0  1  _  ^*0 

zy  ]jMo  &  ojyjeji^  K 

_L=  f/4  1  _  A  _  A 

azy  ]J  e0  co  co^jeoJu0  k0 


Identity  #3 


Identity  #4 


Identities  1  through  4  are  then  substituted  into  equation  (R.24)  to  get: 


Xjk  =  — L  v'  —V'  Sjk _ —  vA'  —  vJ _ —  yJ  +  — 

2 k0  T  £  T  2k0  T  e  T  2k0  T  n  T  2 


Sji 

vft 


Se 


'o  y 


SJk  (R.26) 


Used  Identity  #3 


Used  Identity  #3 


Used  Identity  #4 


Used  Identity  #1 


and  into  equation  (R.25)  to  get: 
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R.2  The  transformation  tying  the  parabolic  equation  field  eT  to 
the  transverse  magnetic  induction  i'i r 

Appendix  R.2  derives  equation  (7.6)  (with  definition  (7.7)),  an  approximate 
transformation  connecting  the  familiar  full-wave  magnetic  field  HT  with  the 

corresponding  auxiliary  field  0T  that  is  associated  with  downrange  propagation  by  the 
parabolic  equation.  At  the  end  of  this  appendix,  we  also  derive  a  “minimal  version”  of 
equation  (7.6)  applicable  when  the  grazing  angle  is  very  modest. 

This  relatively  crude  look  at  the  “endpoint  transformation”  makes  a  number  of 
assumptions.  Here  in  Appendix  R.2,  propagation  must  be  in  the  downrange  direction 
only  (i.e.,  the  carrier  of  uprange  flux  %T  is  zero).  The  material  must  be  linear  and 

isotropic  so  that  B  =  jLlII ,  and  it  must  not  conduct  (i.e.,  the  free  current  J  f  must  be 

zero).  As  with  the  endpoint  transfonnation  for  a  variable-density  acoustic  field,  it  is 
further  assumed  that  the  environmental  parameters  (in  this  case  the  electric  permittivity 
and  magnetic  permeability)  are  slowly  varying  in  the  immediate  vicinity  of  the 

coordinates  where  the  transfonnation  between  HT  and  0r  is  to  be  applied. 

R.2.1  Deducing  the  basic  form  of  the  carrier  of  the  downrange  flux: 

0T  'JA  ■  ht 

The  derivation  of  equation  (7.6)  begins  with  the  basic  form  of  the  downrange  flux  given 
by  equation  (R.4)  with  the  uprange  flux  set  equal  to  zero  (i.e.,  %T  =0): 
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2  S, 


From  the  structure  of  the  Foldy-Wouthuysen  procedure,  we  know  that  a  linear 
transformation  between  HT  and  dT  exists,  and  therefore  some  dyadic  operator  which  we 


write  as  must  exist  such  that  \[2  ■  fl  ,  9,  .  Now  we  have 


2  S=  —  ■ 


9,. 


h1\4a-42\-ht=h*-a-ht. 


(R.29) 


Note  that 


implies  that  A  is  Hennitian. 


Now,  as  in  equation  (R.4)  (based  on  the  basic  equation  for  a  time-averaged  flux  (R.2)  and 
the  vector  identity  (R.5),  and  noting  that  xxE  is  in  the  transverse  direction),  we  have 


2SX  =  Re[x-£x#*]  =  Re[#*-jcx£]  =  Re[/7* -xxE  ■ 


dr 


Using  the  fourth  Maxwell  equation  in  (R.  1)  (with  the  free  current  J f  =  0  ),  we  get 


dr 


-  Re|~//!  -xxE~\=  —  Re 

'Mo  L  J  \Mo 


Ht  -xx 


f 

(  1  - 

- VxH 

v  icoe  j 


or  using  (R.ll) 


dr 


=  —Re 
Mo 


Hr 


1(0£ 


v,//.r 


c)IIT 

dx 


(R.30) 


Now,  we  need  to  recast  Hx  in  terms  of  HT .  To  do  so,  we  use  the  second  Maxwell 
equation  given  in  (R.l)  along  with  the  assumption  that  B  =  /uH  (i.e.,  a  linear  and 
isotropic  medium).  As  has  been  standard,  we  also  neglect  the  local  range  dependence  of 
[A  at  the  endpoints.  All  this  gives  us: 

V-(juH)  =  0 

M^t  =  -Vt{mHt)  (R.31) 
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//,  = 


dx 


M 


HI 


(R.32) 


Now, 


7  CO£ 


U-*' 


V 


dx 


lCO£ 


"ox 


M 


dx 


H]t 


or  using  dyadic  notation 


1 


lCO£ 


^tHx  ~ 


})flT 

dx 


lCO£ 


V7  1  ^  V7  ^ 

r3/>  r//  +  & 


dr 


J_ 


3x  'V 


HT 


Hr 


(R.33) 


Substituting  (R.33)  into  (R.30),  we  have 


e, 


2  £„ 

=  ^Re 

Wo 


Hr 


&»£■ 


i  —  -V  _ - _ -V  u 

1  -\  T  X  /  v  TH' 

OX 


<34" 


Hr 


(R.34) 


Now  note  by  setting  £  and  f.l  locally  equal  to  constants,  the  operator  between  H*T  and 
Ht  becomes  Hennitian,  and  we  can  drop  the  “real  part”  operator  to  get: 


0T 


|2 


(R.35) 


Or  taking  the  square  root  -  i.e.,  reading  off  yd  ■  HT  by  comparing  equations  (R.29)  and 


(R.35): 


Or 


(  £  ^ 
c0 

'A  1 

7-A-V 

1  V 

Wo  ) 

( 0)£)l/ '2 

dx 

T  .X  /  T 

l/dx 

Hr 


(R.36) 
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Note  that  if  we  did  not  want  to  make  the  assumption  that  e  and  jU  are  locally  constant, 
then  we  would  have  to  symmetrize  result  (R.34)uuuu.  To  keep  things  reasonably 
straightforward,  we  will  not  consider  this  case  here. 


R.2.2  Getting  rid  of  the  d/dx  operator  in  VI 

Next,  we  need  to  evaluate  the  operator  i  /x  acting  on  HT.  To  do  so,  we  again  neglect 
(local)  range  dependence  (at  the  endpoints),  and  exploit  the  fact  that  (as  with  the  variable 
density  acoustic  equation),  in  the  absence  of  range  dependence  we  can  directly  take  the 
square  root  of  the  wave  equation  for  HT .  The  parabolic  equation  derived  in  this  way  is 
different  from  the  one  obtained  using  the  Foldy-Wouthuysen  transformation.  As  in  the 
variable  density  acoustic  case,  the  parabolic  equation  for  HT  does  not  generalize  to  the 
range-independent  case. 


Let  us  begin  with  Lemma  B  of  Section  R.  1  (equation  (R.7)).  This  result  is  a  reworked 

form  of  the  wave  equation  for  H ,  and  it  is  the  wave  equation  for  IIT .  It  is  repeated  here 
for  in  slightly  modified  form: 


Ai  A#r/ 

dx  £  dx 


Vr--Vr 

£ 


-j  -j  /  2 

Ht - -VtHx  -  Vr-VrH!r  +  —Ht 

dx  £  £  £ 


0. 


Neglecting  (local)  range  dependence  of  £ ,  this  becomes 


1  d2HT 
£  dx 1 


r 


-  + 


\ 


Vr--Vr 

£ 


H  -iv 

T  T 

£  dx 


_  1  _  _  k  _ 

-ViT-VTH'T+—HT 
£  £ 


0. 


(R.37) 


Now,  substitute  (R.31)  into  (R.37)  to  get 


1  d2HkT 
£  dx2 


-  + 


1  -  > 

VT--VT 

HkT+-VkT 

1 

E-h 

b-i 

t> 

-1 

1 _ 

{  £  J 

£ 

LA  J 

_  1  _  _  p  _ 
-ViT-VkTHiT+—HkT=  0. 


Multiply  by  £  and  take  the  square  root  of  the  operators  acting  on  IIT : 


uuuu  One  would  have  to  write  the  real  part  as  a  sum  of  the  function  and  its  complex  conjugate,  and  then 
carefully  reverse  the  order  to  recover  the  proper  form  with  HT  on  the  left  and  l IT  on  the  right.  This 
procedure  would  involve  integration  by  parts,  and  it  would  essentially  run  the  analysis  in  Appendix  A  in 
reverse. 
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(R.38) 


.dH*  (*.  1  A 

1—^=  .  £ 


dx 


VT  ■  —  VT 

v  e  j 


Sjk  +  VkT-V]Tju- £V —  VkT  +  k2Sjk  ■  H]t 

H  £ 


Now,  to  make  this  result  compatible  with  (R.36),  also  neglect  the  (local)  dependence  of 
£  and  fi : 


,  i)Hk 
dx 


=  ^V2  +k2  Hk. 


Thus, 


k  k 

/i0  /t0 


(R.39) 


Now, 


k  =  co^e/li  =  (oJ[£0  +  &■)  (ju0  +  Sju) 

—  O)^£0/J,q 


f1+-] 

V  e0  J 

v  Mo) 

(R.40) 


Substituting  (R.40)  into  (R.39),  we  have 


.3  ,  V2  Se  Sju  Se  SjU  .  ,  21 

i  —  =  k  — T-  +  1  +  —  +  ^-  + - ~=k0  1  +  — 

ox  V  k0  £f)  fi0  £f)  //„  k0 


(R.41) 


where 


.  V2  k 
A  =  —JL  +  — 

2  K  2 


'  Se  Sn  +  Se  +  Sn 

V  £o  Mo  £o  Mo  J 


(R.42) 


Substituting  (R.41)  into  (R.36),  we  have 


0T 


(  £  i 

A 

l 

li  +  2i-v 

1  w 

\Mo  J 

1  +  ^)<v 

*0^ 

1  K  T  koy 

+ 

o*i£, 

1 _ 

■H, 


(R.43) 


Equations  (R.43)  and  (R.42)  recover  equation  (7.6)  and  definition  (7.7)  respectively. 


R.2.3  The  “minimal”  correction 

Finally,  to  get  a  “minimal”  correction,  assume  the  grazing  angle  is  very  small.  Then, 
equation  (R.43)  becomes 
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6, 


f  „  \ 

i 

" 

£q 

k  1 

\Mo  j 

yfeco 

L  v 

8e  Sju  +  Se  +  Sju 
£0  Mo  £0  Mo 


Ht 


f 


-o 


\Moj 

(  £ 

co 

\Mo  j 


yfeco 


CO 


yJ^oMo 


'  &V 

1  +  — 


-j=\£M]AHt  = 


V 

'JL 

\Mo  j 


'0  J 


i  +  SJL 

ft/ 

/ 

4 

Hr 


II, 


\£  J 


Dropping  irrelevant  constants  that  cancel  at  the  endpoints,  the  “minimal  correction”  is 
9T  =  (lile)/ HT+0(S72Tlkl). 


S  Appendix:  The  elastodynamic  field 

This  appendix  supplies  the  details  behind  the  results  presented  in  Section  7.2.  Appendix 
S.l  derives  the  state  space  equation  for  the  elastodynamic  field,  equation  (7.13)  (with 
definitions  (7.1 1)  and  (7.12)).  Appendix  S.2  derives  an  approximate  transformation 
connecting  the  familiar  full-wave  displacement  vector  u  with  the  corresponding  auxiliary 
field  x  that  is  associated  with  downrange  propagation  by  the  parabolic  equation.  The 
transformation  is  given  by  equation  (7.15)  (with  definitions  (7.16)  and  (7.17)).  This 
problem  is  more  difficult  than  the  electromagnetic  case,  and  so  equation  (7.15)  is  only 

good  to  first  order  in  Vr .  Together,  these  two  results  provide  enough  information  to 
allow  the  straightforward  mechanical  implementation  of  the  formalism. 


S.l  The  state  space  equation  for  the  elastodynamic  field 

Subsection  S.1.1  derives  the  state  space  equation  (7.13),  and  Subsection  S.l. 2  examines 
several  interesting  aspects  of  this  result. 


S.1.1  The  calculation 

To  begin  the  derivation  of  the  state  space  equation  for  an  elastodynamic  field  (7.13), 
Subsection  S.l.  1.1  outlines  the  basic  formalism  for  an  elastic  solid  and  states  the  Foldy- 
Wouthuysen  ansatz  for  this  problem.  Subsection  S.  1 . 1 .2  evaluates  the  three  constituent 
partial  differential  equations  of  the  matrix  state  space  equation  that  are  relatively  easy  to 
evaluate.  Subsection  S.  1 . 1 .3  evaluates  the  hard  one.  While  the  Hamiltonian  in  the  state 
space  equation  is  pseudo-Hermitian,  the  operators  in  the  constituent  partial  differential 
equations  should  be  Hennitian.  This  is  not  manifestly  apparent  for  some  of  the  terms  that 
appear,  but  Subsection  S.l.  1.4  shows  that  these  terms  indeed  properly  contribute  to 
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Hermiticity  for  the  overall  operator.  Subsection  S.  1 . 1 .5  combines  the  four  component 
partial  differential  equations  to  form  a  state  space  operator.  The  combined  operator 
should  be  manifestly  pseudo-Hennitian,  but  once  again  this  is  not  manifestly  apparent. 
Subsection  S.l.1.6  verifies  that  the  derived  Hamiltonian  in  the  state  space  equation  is 
indeed  pseudo-Hennitian.  Subsection  S.  1 . 1 .7  takes  the  state  space  equation  and  converts 
it  into  a  standard  fonn  that  resembles  the  state  space  equations  for  the  acoustic  and 
electromagnetic  fields.  Finally,  Subsection  S.  1.1.8  obtains  workable  expressions  for  the 
associated  partial  differential  operators  that  appeared  in  the  state  space  equation  derived 
in  Subsection  S.l.1.7. 


S.l.1.1  The  basic  formalism  and  the  ansatz  for  the  elastodynamic  field 


Once  again  as  in  Appendix  R.  1 ,  let  the  discussion  at  the  beginning  of  Appendix  I  guide 
us  in  constructing  an  ansatz  that  will  lead  to  a  state  space  equation  that  is  a  suitable 
starting  point  for  the  Foldy-Wouthuysen  (FW)  transformation.  Specifically,  start  with 
the  basic  fonn 

(0\ 

<F  = 

U; 


and  construct  0  and  X  from  the  original  field  in  such  a  way  that  $  rfb  -  \d\  -  \x\  is 

proportional  to  the  downrange  flux  Sx .  As  noted  in  Appendix  I,  this  is  enough  to 

guarantee  that  the  associated  state  space  equation  constitutes  a  suitable  starting  point  for 
the  Foldy-Wouthuysen  transfonnation. 


The  appropriate  ansatz  is 


<f> = 


X 

X 


—  (u  ±iax  ■  f ) 
2 V  ' 


(S.l) 


where  u  is  the  displacement  vector,  the  stress  energy  tensor  f  is 

fiJ  =  A(V-u)SIJ+jU  (VV  +  V  V ) ,  (S.2) 


and  a  is  a  constant  included  for  dimensional  reasons.  <£>  is  a  6-dimensional  vector. 


The  elastodynamic  wave  equation  is 

V  •  f  =  - poru  (S.3) 

and  the  associated  (time-averaged)  downrange  flux  is  (e.g.,  see  equation  2.3  on  p.  154 
from  reference  [154]): 

Sx  =  hm[u-f  •*].  (S.4) 
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Note  that 


e-0-Z-X  =  Ylm[U*'f'x]  =  ~aS*  06  5x >  (S.5) 

and  so  our  ansatz  indeed  generates  carriers  of  the  downrange  flux. 

Now  we  proceed  to  construct  the  state  space  equations.  Once  again,  following  Sections 
C.  1 . 1 , 1  and  R.  1 ,  recall  that  the  derivation  of  the  state  space  equation  requires  that  we 

develop  two  separate  intermediate  equations:  one  involving  d[d  - X ) fix  and  the  second 

involving  d  +  x ) fix  .  Treating  the  downrange  and  transverse  components  of  0  +  x 

and  6  -  X  separately,  this  becomes  four  equations.  We  develop  these  four  equations 
next. 


S.l.1.2  The  “easy”  part:  equations  for  d(0x- Xx)fai  d(0 x  +  Zx)fa  and 
d[0T+XT)fix 


Equation  #1  (equation  (S.7)) — for  d  (0.  -  Xx )  fa 


Take  the  divergence  of  [0  -  x)/{ia)  =  f  •  x  and  then  use  the  wave  equation  (S.3): 

v{0-z) 


la 


=  V  ■  f  ■  x  =  -par  ux  =-po)2{0x+xx), 

L-v-J 

0x+zx 


(S.6) 


and  so  /V  •  -  x )  =  pco2a  (0X  +  Xx )  or 

/3(\~Zx)  =  ■(&r-Zr)  +  pora  (0X  +  ) .  (S.7) 

We  will  save  the  equation  involving  d  fa  -  xT )  fix  for  last. 

Equations  #2  and  #3  (equations  (S.10)  and  (S.  11)) — for  d(0v  +  Xx)  fa  and 
dfa+Zi )fix 


Again  subtract  the  two  components  of  (S.  1),  but  this  time  use  definition  (S.2)  to 
substitute  for  f : 

0  -  X  =  idf  ■  x  =  ia 


x(y  -ufi + p 


du  - 

^-  +  Vw, 

ox 
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or 


?—X=f  .x  =  (A  +  2ju)^x  +  A(Vt-ut)x  +  jlNtux+jU^l.  (S.8) 

ia  dxK  ox 


First,  we  will  develop  the  equation  for  d  ( 0  +  Xx)/dx  (Equation  #2)  by  projecting 
equation  (S.8)  into  the  x  -direction: 


(e-z) 


0  -Y 

X  Zl  x 


(A  +  2/i)^  +  A  (Vr-wr) 

/-I  V  '  ' 


ia 


ia 


dx 


and  so 


du 


{A  +  2ju)^  =  -A(Vt-ut)  + 
ox  v  ’ 


■  dux 
dx 


A 


-7  - 


A  +  2/J 


(Vr  •  ) 


+ 


Q  -  Y 

x  Sv  x 

ia 

ox-zx 

a(A  +  2fa) 


(S.9) 


And  using  u  =  6  +  % ,  we  have  Equation  #2 


rd(?v  +  ZX) 

dx 


-7 


A 


A  +  2/J 


'(^r  +  Xr  ) 


+ 


0x-zx 

a  (A  +  2ju) 


(S. 10) 


Now,  let  us  develop  the  equation  for  d  [dT  +  ZT)  jdx  (Equation  #3)  by  taking  the 
transverse  projection  of  equation  (S.8)  (i.e.,  (l  -  xx)  ■...): 


0T-  xT  t— 7  du, 

T  --lNrux  +ji-  T 


ia 


.  du 


-l  =  -Htux  + 

dx 


dx 

@T  ~  Zt 
a/J 


Next,  using  uT  =  0T  +  XT  and  ux  =  0  +  %x  we  obtain  Equation  #3: 

diOr+Zr)  -  1  /-  \ 

i-X- — L=-ffT(e,+z,)+- -(t>T-Zr), 

dx  ajU 


(S.ll) 


S.l.1.3  The  “hard”  part:  the  equation  for  d[dT-xT)/dx 
Equation  #4  (equation  (S.20))  —  for  d  [dT  - Xt)/^x 
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The  remaining  equation  needed  to  derive  the  state  space  equation  is  the  most  difficult  to 
obtain. 

Recall  that  to  derive  Equation  #1  (equation  (S.7)),  we  used  the  x  -component  of  the  wave 
equation  (S.3).  To  develop  Equation  #4  (equation  (S.20)  below)  start  with  previously 
unused  transverse  components  of  the  wave  equation: 

V  ■  f  ■  (1  -  xx )  =  -pOTuT ,  (S.12) 

and  break  apart  the  gradient  operator: 

Tx  •  f  ■  (l-xx)l  =  —  Vr  •  f  ■  (1  -  xx)  -  pco2uT  .  (S.13) 

ox L  J 

Now,  set  this  result  aside,  and  note  that 

0T  —  Xt  =  *a[x-  ^  ■  (l-xx)]  .  (S.14) 

Operating  on  (S.14)  with  id/dx ,  we  get 

0t~Zt)  ?  r- 

i  1  ^  T)  =-a-^-\x-f-{l-xx)r  (S.15) 

ox  ox L  J 

Substitute  (S.13)  into  (S.15)  to  get 

3  [0T  —  XT 
dx 

We  are  not  quite  finished,  because  we  still  need  to  evaluate  Vr  ■  f  ■  (1  -  xx) .  Let  us  begin 
with  the  transverse  projection  of  the  definition  of  the  stress-energy  tensor,  equation  (S.2): 
f”=A (V-m)  S'j.  +Ju(ViTuJT  +  VJTuiT) 

'oT  o' 

0  1  0 
,0  0  1, 

=  S*+A(VT-uT)S$+fi(V,T4+VJTuiT)  . 

OX 

we  know 
this 


) 


=  aVT  ■?■(!-  xx)  +  pco2au7 


(S.16) 


Now,  substitute  for  du  jdx  using  equation  (S.9)  (divided  by  i ): 

f  n  \ 


fj  =  A 


A 


V 


A  +  2// 


^  Vr  •  uT )  +  ■ 


ai 


gv  ~%x 
(A  +  2p) 


5%  +  A  ( Vr  ■  uT )  S. 


+//  ^  Vj.w.j  +  ) 


(S. 17) 


Also  note  that 


355 


A2  +^_  X  +  A(A  +  2 ju)  _  2 juA 
A  +  2/j  A  +  2  ju  A  +  2  ju  A  +  2/u 


so  that  (S.17)  becomes 


Tl] 

iT 


2A/J  (VT-uT)S?+jU (Vj.ii'  +  V'm \ )  +  ^  $ 


A  +  2ju 


at 


(A  +  2ju) 


=  Vr  ■(«r  +  fc)'?+^(''i(«;  +  ^)  +  ^(»;+/,))  +  rilt+2/jjS'  (S- 1 8) 


2Ap  S?VkT  +  JU  [VjSj  +  VJTS'k )  (dT+Zr  V  +  S't 
A  +  2/u  v  ’  v  7 


/l 


ai(A  +  2ju) 


(%-&)• 


Thus, 


VT  ■?  ■  (l-xx)  =  Vr  ■  fT 


=  VJ 

v  T 


tg  +  -  \k 

A  +  2JU  r  Xt’ 


+ 


+%M%S?  (0T  +  Xt  f  +  VJ//V7  (dT  +  xT  f 

/  N 

/l 


+v; 


ai 


i ( A  +  2  /J ) 


(^-Zv)  , 


(S.19) 


and  substituting  into  (S.16)  gives  us 


dx 


+V' 


77  J^-^T  +  Kf^TSTk  +  KP^T 

r  A  ' 


{pr  +  Xt) 


ai{A  +  2n) 


{X~Zx)  + pa¬ 


ct  u 


(§t+Zt)J 


(S.20) 


Equation  (S.20)  is  Equation  #4. 


S.l.1.4  A  note  on  Hermiticity 


Note  the  second  to  the  last  terms  in  equation  (S.20)  and  in  (S.  10).  Individually,  it  looks 
like  they  will  violate  Hermiticity,  but  taken  together  they  give  a  structure  of  the  basic 
form 


'  0  iVTf 
jf^T  0  , 


(S.21) 


If  we  take  the  transpose  and  the  complex  conjugate  of  this  matrix,  we  wind  up  with 
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'  0  -ifv.A 

K-iVTf  0  J  ’ 

and  then  integrating  by  parts  we  get  back  to  (S.21).  Thus,  the  matrices  of  the  form  (S.21) 
are  perfectly  good  Hermitian  matrices.  It  is  also  worth  noting  that  one  finds  a  similar  set 
of  operators  that  initially  seem  to  violate  Henniticity  in  equations  4,  5  and  6  of  reference 
[155], 


S.l.1.5  Combining  the  component  partial  differential  equations  to 
form  the  state  space  equation 


Combining  the  four  equations  (S.7),  (S.10),  (S.l  1)  and  (S.20),  we  have 


^  [@T  Xt  ) 
dx 


=  a 


Vi ~^z l- vkT(eT+ xT )k  +  + vkrJuvjr  (0T+zT) 


-njt 


A  +  2// 

r  [A  +  2/j) 


(0x-Xx)  +  pora^+Xr)1 


di"d\  Xx>i  =  -iV-r  •  (dT  -  Xt  )  +  P°)2a  (#  +  Xx ) 


dx 

d(9T  +  XT) 

dx 


=  -iVT{dx+xx)  +—(dT-XT) 

' — v — '  aju 


(S.22) 


:~d{dx+Xx)  _  ■ 
dx 


(  A  ' 

^  A  +  2jU  j 


Vr-(^+Zr)+a(/l  +  2//)»,  X, 


Note  that  a  vector  with  a  subscript  T  is  a  2-dimensional  vector  in  the  (transverse)  y-z 
plane.  An  ordinary  vector  without  the  subscript  also  adds  a  third  dimension:  the 
downrange  direction  x ,  and  so  the  subscript  x  denotes  a  component  of  the  vector  in  the 
downrange  direction. 

Now,  the  indices  ye  {1,2}  denote  y  and  z  components,  while  the  indices  j  =  0  and 
k  =  0  denotes  an  x  -component.  Then,  with  the  following  definitions: 
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(S.23) 


bJk=, 


VJr  U/I  V*  +  VTpV‘T  S.f  +  V* //V 


/l  +  2  p 


rp  JL&/  T  Ji  i/  'J' 

gJk-gi«gk0 


:Jk  =  -Nl 


r  A  A 


(A  +  2  p) 


eJk  =  -iSJ0Vk 


Sk0  ;  dJk  =  pco2  a  8f 

gft-gJOtfO 

;  fjk  =Sj0Sk0pco2a 


the  first  two  equations  in  (S.22)  combine  to  form 

d(d-z) 


dx 


(bjk  +  dJk  +  fjk  )(o+z)  +  (cjk  +  eik  )(d~z) 


(S.24) 


Note  that  djk  +  fJk  =  pco2adjk . 


Similarly,  with  the  definitions 
gjk  =  -iVJTSk0 


;  hJk  =  —  5:? 
ap 


pjk  =  -i 


A 


A  +  2  p 


VkTSj0  ;  qjk 


gikSi«gka 

1 


'  i^A  +  2p) 


Sj0S 


■k  0 


(S.25) 


the  third  and  fourth  equations  in  (S.22)  combine  to  form 

=  (g/»  +  y*  )(*  +  +  +  q* )  (S.26) 


With  the  definitions  (suppressing  the  indices  j,  k  ) 

A=b+d+f  ;  B=c+e 
C=g+p  ;  D=h+q  , 

equations  (S.24)  and  (S.26)  become 

=  A'ik  + +  “  *)* 

/  =  c/,£  (0 + ^)*  +  DJk  (e  - 


(S.27) 


(S.28) 


Below,  it  will  be  understood  that  A,B,C  and  D  are  3x3  matrices,  and  that  the  vectors 
9  and  Z  can  be  understood  as  3x1  column  matrices.  Then,  (S.28)  becomes 
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(S.29) 


lS-^jl  =  A(0  +  z)  +  B{e-z) 

id^A  =  c(e+xhD(e-x) 


To  get  the  state  space  equation,  we  will  add  and  subtract  the  equations  in  (S.29).  Adding 
and  dividing  by  2,  we  get 

i^-  =  UA  +  B  +  C  +  D)d-UA-B  +  C-D)x. 

ox  2  2 


Subtracting  the  first  from  the  second  and  dividing  by  two,  we  get 

i-  =  -(-A-B  +  C  +  D)0--{-A  +  B  +  C-D)% 
dx  2  2 


or 


fd0x 


dx 

_  1 

'  A  +  B  +  C  +  D  A-B  +  C-D ' 

'o' 

k-A-B  +  C  +  D  - A  +  B  +  C-D , 

ydxy 


Thus, 


,(KI> 

1  a7 


Standard  terms  just  like  in  Acoustics  and  E+M 


+  \(A-D) 


V 


0  1 

-1  0 


i(c+s) 


v 

(1  0 ^ 
0  1 


A(C-B) 


i 

fo  o 


1  0 


New  terms  not  previously  encountered 


<t>  =  7f<J> 


(S.30) 


S.l.1.6  Verifying  pseudo-Hermiticity 

Note  that  in  the  matrix  Hamiltonian  7f  we  now  get  new  terms  of  a  sort  not  previously 
encountered  in  the  acoustic  and  electromagnetic  problems.  Before  proceeding,  let  us  take 
a  closer  look  at  these  terms.  The  matrix  operators  Tj^  and  1  have  not  appeared  before  in 
our  state  space  equation,  but  the  most  interesting  new  feature  lies  in  the  nature  of  the 
coefficients  of  these  matrices.  For  example,  expressing  C  +  B  in  terms  of  the  definitions 
in  (S.27)  and  rearranging  terms  a  little,  we  have 
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(S.31) 


and  let 


C  +  B  =  (c  +  p)  +  (e  +  g) 

A  X  ' 

( X+2]i )y 


-Njt 


dk0+-i[  A  Ivy 

^zl  +  2//  J 


+  (-iSj0Vkr-iVJrSk(>) 


a  =  (C  +  B) 


-Njt 


kl  0 

lo  b 

X 


(X  +  2  ju) 


?k  0 


X 


A  +  'l/j 


vi^y0 


+  (-i^°V*  -NJT8k0) 


f\  0^ 
v0  \j 


(S.32) 


Let  us  verify  that  the  tenn  (S.32)  is  pseudo-Hermitian.  Recall  that  some  operator  a  is 
defined  to  be  pseudo-Hermiticitian  if  [m])  =  arj  or  equivalently  i]a'  =  ar/  .  Therefore, 
to  verify  pseudo-Henniticity,  we  need  to  find  a'  and  commute  it  with  ij . 


To  find  a}  ,  we  need  to: 

1.  Take  the  Hermitian  conjugate  of  the  differential  operators.  This  means  integrate 
by  parts,  and  take  the  complex  conjugate.  This  amounts  to  VTf  — >  -/Vr  for  a 
real  function  /  and  i  — >  -i . 

2.  Exchange  indices:  j— >kandk— >y.  This  exchanges  rows  and  columns  in  the 
3x3  matrices. 

3.  Take  the  Hermitian  conjugate  of  the  2x2  matrix  in  our  outer  product:  exchange 
rows  and  columns  and  take  the  complex  conjugate.  (The  unit  matrix  in  the 
example  in  (S.32)  is,  of  course,  its  own  Hermitian  conjugate.) 


Then  to  verify  pseudo-Henniticity,  take  ria  and  a  r] ,  and  verify  that  they  are  the  same. 
This  constitutes  step  4. 


Let  us  do  this  for  example  (S.32). 

(1)  Hermitian  conjugation  of  the  operators: 

,  f  -  ^ 

C  +  B-i 


-l 


X 


{X  +  2  /i) 


VJTSk0  -iVkT 


X 


X  +  2  fd 


sj0 


+ 


(-iVkrSJ0 


-njts 


■k  O' 


(2)  Exchange  indices  to  get 
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+(-ivJTsk0-ivkTsJ0) 


which  from  equation  (S.3 1)  is  equal  to  C  +  B  . 

(3)  and  (4)  Taking  the  complex  conjugate  of  the  unit  vector  and  multiplying  it 
with  itself  are  trivial. 


Thus,  this  term  is  indeed  pseudo-Hennitian,  as  it  should  be  given  that  the  construction  of 
the  state  space  equation  guaranteed  that  it  would  be  so.  However,  for  the  first  time,  the 
pseudo-Hermiticity  was  not  obvious  by  casual  examination. 


Again  assign  it  the  label  a .  Demonstrating  that  this  term  is  pseudo-Hennitian  is  trickier 
than  for  our  previous  example. 

To  begin  with,  we  have  (using  definitions  (S.23),  (S.25)  and  (S.27)) 

C-B  =  (gjk  -ejk)  +  (p,k  -cJk) 


Then,  we  again  follow  the  3-step  procedure  to  take  the  Hermitian  conjugate  of  a  : 


(1)  Hermitian  conjugate  these  differential  operators  to  get 
C-B^> 


(2)  Exchange  indices: 
C-B -» 


(3)  Taking  the  Hennitian  conjugate  of  the  2x2  matrix 
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"0  P 

p  0, 


we  just  get  it  back. 


Thus, 


a' 


4<c-*> 


o  1 
V1  oy 


- a 


and 


W'  =  -\(C-B) 


1  0 
0  -1 


0  1 

V1  oy 


0  1 

-1  0 


=  \(C-B) 


0  -1 

1  0 


Now, 


aq=l-(C-B) 


0  1 
1  0 


(I  0 
0  -1 


o  -1 

vi  oy 


and  7] a'  =  (at]) 1  =  at] ,  and  we  have  explicitly  seen  that  now  the  second  “new”  tenn  in 
(S.30)  is  also  pseudo-Hermitian. 


Thus,  the  third  and  fourth  terms  in  (S.30)  are  properly  pseudo-Hermitian.  The  first  and 
second  terms  are  completely  standard  and  quite  obviously  pseudo-Hermitian  as  well. 
Thus,  we  have  indeed  explicitly  verified  that  the  matrix  Hamiltonian  7 i  is  pseudo- 
Hermitian,  and  consequently  that  our  formalism  properly  conserves  energy.  (From  our 
construction,  we  knew  things  should  turn  out  this  way,  but  it  is  nice  to  see  that  it  really 
worked  out.) 


S.l.1.7  Standard  form  of  the  state  space  equation 

Next,  let  us  put  the  matrix  Hamiltonian  TL  into  standard  form.  We  do  not  yet  know  what 
the  constant  k0  will  turn  out  to  be  (because  we  have  not  yet  chosen  a  in  ansatz  (S.l)), 

but  we  do  know  that  we  will  need  to  pull  out  a  term  of  the  fonn  k0Sjkt ]  (or  in  matrix 
notation  k{)  1 3x3  ®  t]  )  from  the  Hamiltonian  ‘H  in  equation  (S.30).  Thus,  we  have  (as 

above,  suppressing  the  explicit  outer  product  sign  ®  to  keep  things  from  getting  too 
cluttered) 
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(S.33) 


1  ,  JO  1 

0  =  —(A  —  D) 

~  2 '  7  l-l  0 


4(c~s) 


£  =  ^(A  +  D-2ka\) 


1  0 

vO  -1, 


1 


0  1 

vl_0y 

T 

f  i  rA 


+  2^  +  *) 


1  0 
0  1 


where  H  =  O  +  £  +  k0lix3rj  .  The  tildes  underneath  serve  to  remind  us  that  the  matrices 
are  6x6. 

Note  that  Or/  =  -rjO  (the  fact  that  \r] ,r}  =  0  is  crucial  here),  £ri  =  r]£ ,  if  =  1 .  Also,  we 
have  closure  under  multiplication'' vvv  (as  we  must  since  the  four  matrices  1  span 

the  set  of  2x2  matrices).  We  know  from  reference  [1]  (or  equivalently  Appendix 
C.2.3.2)  that  the  conditions  outlined  in  this  paragraph,  and  the  basic  form  in  the  previous 
paragraph  is  enough  to  ensure  that  the  Foldy-Wouthuysen  procedure  will  apply. 

Before  we  go  on  to  write  out  the  coefficients  in  equations  (S.33),  let  us  examine  one  more 
important  issue.  As  it  stands,  the  coefficients  in  equations  (S.33),  which  are  in  a  sense 
expansion  parameters,  contain  terms  that  are  not  yet  “small”.  Such  terms  appear  in  both 
O  and  £  ,  but  let  us  examine  their  appearance  in  O ,  where  it  will  most  directly  affect  the 
expansion  parameter. 

In  O ,  the  term  appears  in  the  coefficient  of  J/2  : 


vvw  j|lc  identities  <^T  =  T/  =  —fj  and  //<J  —  T  —  ~^Tj  serve  to  explicitly  remind  that  us  that  the 
expanded  set  of  four  matrices  that  now  appear  in  the  matrix  Hamiltonian  7f  are  indeed  closed. 
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f 


A-D- 


b+  d  +  / 

pco2  ad jk  = 


=  b  + 


pco2  a 


- 

( h  + 

q) 

r 

y^.  +  2/Z 

0  0 

1 

a 

0 

I  0 

ft 

0 

V 

0  1 
ft) 

l 

a  (A  +  2//) 
0 

0 


pco~a  — 


ap 


0 

0 


2  1 

pco~a - 

ap  j 


Since  b  is  already  small,  this  needs  to  be  small  too 


The  components  of  the  matrix  above  must  be  made  small.  The  first  step  is  to  add  zero  in 
the  form  -p0a)a  +  pQ0)a  to  all  three  terms,  l/(4>  +  2p0 )  - 1/(4,  +  2p0 )  to  the  top  term, 

and  1/ ( ap0 )  - 1 /(ap0 )  to  the  bottom  two  terms wwww.  This  will  give  us  a  matrix  whose 
terms  involve  only  the  deviations  from  the  reference  values  of  the  parameters  p ,  A +  2 p 
and  p  .  This  action  also  generates  an  additional  matrix  that  we  examine  next. 


Once  this  first  step  is  done,  we  are  still  left  with  a  “problem  term”  of  the  form: 


f 


p0co  a 


1 


a  (A)  2p0 ) 


o 


2  1 

0  P0G)  a - 

aPo 


0 

0 


7  1 

0  p0co  a - 

ap0 


This  term  has  an  interesting  property  that  is  a  little  different  from  anything  encountered 
previously  in  our  study  of  acoustic  and  electromagnetic  fields.  To  see  this  unique 
property,  let  us  examine  the  following  thought  experiment.  Consider  a  1 -dimensional 
(plane)  wave  in  a  completely  homogeneous  environment,  with  the  reference  parameters 
all  chosen  to  be  the  same  as  the  medium’s  parameters,  which  are  all  assumed  to  be 


wwww  Note  that  p0  does  not  in  principle  have  to  be  the  same  as  p0  .  In  fact,  p0,A0+  2ju0  and  p0  could 
refer  to  different  spots  in  the  medium,  or  some  set  of  reference  values  that  is  not  associated  with  any  point 
in  physical  space.  However,  usually  juo  will  be  chosen  to  be  the  same  as  po  . 
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globally  constant.  Even  in  this  most  basic  case,  there  is  nothing  to  make  this  term 
disappear  under  these  circumstances.  This  is  the  property  that  is  a  little  different  from 
any  encountered  previously  in  our  study  of  acoustic  and  electromagnetic  fields.  To  be 
specific,  recall  that  during  our  study  of  the  electromagnetic  and  acoustic  fields,  we  never 
encountered  such  a  property  at  all  in  the  odd  terms  (i.e.,  off  diagonal  in  the  2x2  state 
space),  and  when  we  observed  such  terms  in  the  even  terms  (i.e.,  the  terms  that  are 
diagonal  in  the  2x2  state  space),  they  eventually  disappeared  except  for  the  ever-present 
kjj  term.  Fortunately,  we  can  almost  make  these  tenns  disappear  with  a  judicious 
choice  of  the  free  parameter  a  . 


Choose 


cp  = 
c0  — 


I  A  E  2/c 


Po 


so  that  (with  kt]  =O)/cs0  ) 


p0O)2a 


P0A 

mA 


1  _  Wo 

a  (  A  )  Ao  +  ^Po 


(S.34) 


(S.35) 


and  similarly 


Now  our  “problem  term”  reduces  to 

(i 


0  0 

0  0 
0  0 


V 


J 


Note  that 


1- 


yo_ 

rp 

V  co  J 


1- 


Po 


A)  +  2fi0 


A)  +  2//0  //0 
A)  E  2p0 


(S.36) 


(S.37) 


or  with  ft0=  ju0, 


1- 


A+Po 
A + 2p0 
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Provided  A0  and  ju0  are  physical  values  evaluated  at  the  same  location,  then  this 
expansion  parameter  varies  from  1  down  to  %  (as  A  varies  from  infinity  down  to 
-2///3  ;  this  range  of  allowed  values  is  detennined  by  the  requirement  that  the  bulk 
modulus  A  +  Z/J  be  positive156).  Since  it  is  always  smaller  than  1,  this  is  a  legitimate 
expansion  parameter. 

So,  we  are  still  left  with  a  new  intrinsic  expansion  parameter  1  -  (cq/cq  )  .  While 

unprecedented,  after  a  little  thought,  this  will  not  turn  out  to  be  too  surprising.  Let  us  first 
proceed  to  write  out  the  state  space  equation  and  then  return  to  this  issue  in  Appendix 
S.  1 .2.2  (and  in  Section  7.2  of  the  main  text  as  well),  when  we  are  better  prepared  to 
examine  it  in  detail. 


For  now,  we  have  a  perfectly  good  state  space  equation  provided  that  a  =  ij^cop^l )  (as 
given  in  equation  (S.34)): 

f  \ 


O+S 

6x6  matrices 
made  up  of 
outer  products 
of  (3x3)®  (2x2) 
y  matrices 


K  ^3x3  ®  V 

or  equivalently 

K  sJkn 


<E> 

l^rJ 

6-component 

vector 


(S.38) 


with  (using  (S.33)) 


A=Ajk  =3x3  matrix  k 

0=  #+t(C-S)r 

£  =  i(J  +  D-2t0lM)^+i(C  +  S)lJ„2 

/t -2y  K-ip 


(S.39) 


So  now,  we  need  to  evaluate  A,  k,  y  and  /? . 


S.l.1.8  Evaluating  the  operators  A,tc,y  and  jB 

To  begin  our  calculation  of  the  operators  A,  K,  y  and  (5 ,  we  will  evaluate  the  following 
lemma. 

Lemma:  evaluate  “—h  —  q”  (which  shows  up  in  A  and  y) 
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We  make  the  usual  choice  a  =  1 /(&>/?,/ )  (but  we  will  not  at  this  point  restrict  ourselves 
to  jU0  =  ju0 ),  and  use  definitions  (S.25): 

-h-q  =  ~  —  Sjk  +-W° - — 1 - -8J08k0  .  (S.40) 

ajU  a/u  a{A  +  2p) 


Now,  using  (S.35)  and  (S.36)  respectively,  we  have: 


1 


+  2fi0  ,  5 

■  tXn. 


6f(/l  +  2y4/)  A  +  2ju 

^  _  Mo  L! 

K0 

aju  p 


rp 

\C0  J 


(S.41) 


and  so  combining  (S.40)  and  (S.41) 


~—ks8'k  i  — 0  !cs8j0Sk0  A  ^Pl 


-h-q  =  -^ks,dJk 

M  M 

=  -^-k*8Jk  +  ki8Jk  -k*8Jk 
M  ' 


0  jrs 


f  s  Y 

c, 


A  +  2p 


-o 
rp 

vco  y 


/to 


+ 


—knSi0Sk0  -k'8'"8 


/tO 


M 


zero/1 


/  zero 

f  V 


+k;si0sk0-ks0 


rp 

\c0  J 


SJ08 


/to 


/  zero 


zero/1 


-Vv 

c0 

rp 

vco  y 


</°</0  _ 


/  +  2//0  s 
A  +  2ju  0 


_<L 

rp 

vco  y 


/to 


and  so 


-h-q  =  -ks08jk  +ks0Sjk 


\  _Mo? 
K  M  J 


■ks08j08k0 


(  u  \ 

l  M  J 


+k;8J08k0 


+k;8J08k0 


(  rs  Y  f 
c0 
p 


vc0  y 


v 


I  _  /  +  2//0 

/i  +  2p 


c0 
rp 

v  v  0  ^  y 


Now, 


/=///  ;  /  -  -o 


Ck/cp  ;  <=//<, 


(S.42) 
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and  so 


Mp  _  Mol Po  '  Po  _ 
jU  jU/P  P 


r  „s\ 


\c  j 


—  =  n1  — 


P 


Ag  +  2ftQ  _  jo  +2  fig/  Pg  .  /Jg_  _ 

/L  +  2//  A  +  lpjp  p 


P 

f  „P\2 


nP 

VC  7 


^L  =  n2  — 

P  P  P 


and 


-h-q  =  -ks0SJk  +  £0? (<?'*  - SJ0Sk0 ) 


f  a\ 

l-n'A 

P  J 


+t0'5'V‘°(/ir)! 


i-»^ 


P 


(S-43) 


(Note  that  n2p  =  (cs0/cg )  =  Pgj{A  +  2//0)  is  less  than  1  for  the  usual  choice  pu  =  ju0 .) 


Equation  (S.43)  is  the  lemma. 


Now,  we  are  ready  to  evaluate  A,  y,  k  and  fd . 


From  equations  (S.23),  (S.25),  (S.27),  (S.39),  (S.43)  (the  lemma)  and  recalling  the  usual 
choice  a  =  1 j{copQcl ) ,  we  have 

(  \ 


AJk  =  - 

2 


b+  d  +  f  -h-q 

Combine  into  use  the  lemma 
pco2aSjk= 


/ Po 


,  KSJh 


2  COpgCg 


^T^T*t&+Xt)‘  +V‘tPV't  gl  +V‘r/'V 

+  ^  SJk-Si0Sk0 


+  P_k±  Sjk  _  ksSJk  +  K_  /  §jk  _  SjOSkO  \  ( x  _  n2  Po 
Po  2  2L__ - 1{  sPj 


-\\-P/  I A0  gjk 
\  /Pol  2 


!  ,vo 


SJ0Sk0(n*f 


1  ~nA) 

P ) 


sik 


Is 

_i_  Al 


<W°(l-(<)2) 


(S.44) 


We  also  have 
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r„=-  K  s* -h-g 

2  ^  v — v — ' 

Still  use  the  lemma 

V  unassigned  . 


=  \  k0sjk  -  k;sJk  +  k ;  ( sjk  -  sj0sk0  ){\-n]^ 

These  terms  do  not  automatically  get  ^  P  ) 

small.  Use  our  freedom  to  choose 

^  k0  =/c'q  to  get  the  needed  cancellation.  y 

+  ^  k;sj0sk0(n;pf^l-n2p^  +  £0hW°(l -(<)') 


and  so 


1  kl(8]k -5iQdk0)\\-n2s^y 


£0hW°(<)2  1  -n 


’8J08k0(l-(n*y 


(S.45) 


In  equations  (S.43)  and  (S.45),  recall  that  (using  definitions  (S.34)  (and  its 
generalizations)  and  (S.42)): 


,„2  pQ  _  Mo  .  ,„2  Po  _  \  +2//0  _  („sp\2 
„  ~  ’  Up  „  ~  1  ,  o  „  ’  \n°  ) 


P  P 


p  A  +  2  p 


P0 

A +  2  fio 


We  also  need  (using  (S.23),  (S.25)  and  (S.27)) 

*  =  kc~B) 


=  k^  +  P)-{c  +  e))  =  \({s~e)  +  {p-c)) 


Kik  =  - r -iVJTSk0  +  iVk8J0 1  +  -  -if  -  1  Vk8J0  +  iV 

r  T  j  2  {A  +  2p  r 


(A  +  2p) 


8k0  ,  (S.46) 


and  (using  (S.23)  and  the  definition  of  [3  noted  in  equation  (S.39)) 

P  =  -\B  =  -\(c+e) 

\ 

f3]k  =  -  m  -  -  8k0+i8J0Vk  . 

2  t\(A  +  2p)\ 


(S.47) 


Recall  that  equations  (S.44)-(S.47)  are  to  be  substituted  into  the  state  space  equation 
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(S.48) 


0Jk=AJkg+KJkT 

SJk  =  (AJk  -  2  f'k ) jj  +  (k*  - 2 pjk )  1 

HJk  =  Ojk  +  £jk  +  ks0SJkrj  , 


where 


co 


(7T 

(l  0  A 

i 

'  o  o 

(o  r 

ii 

Cr 

o| 

M 

II 

;  t=ti£  = 

c 

1 

v-1  °y 

l1  °y 

These  results  are  also  presented  in  equations  (7. 1 1)-(7. 13)  of  Section  7.2. 


S.1.2  A  few  comments 

Subsection  S.  1.2.1  discusses  a  change  a  variable  from  the  Lame  parameters  //  and  A  to 
their  reciprocals.  This  is  to  facilitate  8  -function  bifurcation  in  cases  where  an  interface 
is  present.  Subsection  S.  1 .2.2  discusses  the  expansion  parameter  associated  with 
equations  (S.37)  and  (S.44).  This  expansion  parameter  is  intrinsic  to  the  parabolic 
equation  for  an  elastic  solid,  since  there  is  no  choice  of  reference  parameters  that  will 
make  it  disappear. 


S.1.2. 1  The  correct  variables  to  use  if  8  -function  bifurcation  is  needed 


When  an  interface  is  present,  we  will  need  to  rework  this  result  as  follows.  In  order  to  be 
able  to  use  8  -function  bifurcation  without  imposing  Taylor  series  expansions  of  the  sort 
discussed  in  Appendix  K.2.3,  we  will  need  to  convert  the  tenns  sandwiched  between  Vr 
in  the  lead-order  derivative  (which  comes  from  powers  of  the  leading-order  terms  in  A  ) 
to  a  form  where  multiplication  by  a  “clean  step”  frees  up  the  lead-order  derivative  for 
further  integration.  To  do  so,  define  K2  =  1/ ju  and  Kx  =  \/A.  Thus, 

An  1  1 


A  +  2ju 


KxK2 


1  2 

- 1 - 

K,  K. 


K2  +  2KX 


2  J 


It  is  not  really  necessary,  but  we  could  also  define  K  =  K2+  2 Kl  =  (A  +  2/u)l(A/ll ) . 
Either  way,  the  leading-order  derivatives  that  appear  in  A  are  now 


Vr  =  Vr 
K 


1 


K2  +  2KX 


VT—VT 

tk2 


This  is  a  form  where  8  -function  bifurcation  can  be  applied  without  further  ado.  Also 
note  that  in  A,  fc,  y  and  /3  we  get  terms  such  as 
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A  _K2 
A  +  2ju  ~  K 


i2A 

P 


OC  K-, 


2  Po  A"  |  K  2 

tln  —  «  — 
p  p  K 


These  terms  never  appear  in  the  lead-order  derivative,  so  they  are  of  no  concern  as  far  as 
the  issue  of  S  -function  bifurcation  is  concerned. 


S.l.2.2  A  new  kind  of  expansion  parameter 


Now,  let  us  return  to  examine  the  “built-in”  expansion  parameter  contained  in  (S.44)  (it 
made  its  first  appearance  in  equation  (S.37)): 


This  term  turns  out  to  be  very  closely  related  to  the  term  p  =  {\  —  n2}  j2  =  {\  —  cy///}  j2  in  a 
constant  density  (i.e.,  8p  -  0  )  fluid  (as  well  as  to  similar  tenns  that  are  proportional  to 
Sp/ pQ,  8e/e 0  and  Sp/p0).  Both  the  p  (or  k()p )  term  and  the  |l  -(cj/c(()  j  Jl  (or 

k0  |l  -  (cV; /cl  )2  )/2 )  tenn  are  caused  by  a  discrepancy  between  the  “fundamental  wave 

number”  of  the  parabolic  equation  (i.e.,  k0 )  and  the  wavelength  of  a  given  physical  wave. 

What  varies  in  these  various  cases  is  the  source  of  the  discrepancy,  but  not  the 
discrepancy  itself. 

This  deserves  a  closer  look.  Recall  that  we  long  ago  established  that  the  parabolic 
equation  selects  out  a  preferred  axis  in  space:  the  downrange  axis.  To  incorporate  waves 
that  have  components  in  the  (non-preferred)  transverse  direction,  the  expansion  has  to 
“work  hard”  and  produce  a  power  series  in  the  expansion  parameter  V2  /k\  (or  more 

generally  iVT/k0  ).  It  is  just  a  little  more  obscure,  but  the  parabolic  equation  also  selects 
out  a  preferred  wave  number  k0 .  Components  of  the  overall  field  with  characteristic 
wave  numbers  that  differ  from  the  preferred  value  k0  are  once  again  properly  brought 
back  into  the  problem  via  an  expansion. 

In  the  constant-density  acoustic  problem,  we  have  a  true  physical  wave  number 
k  =  nk0  =  (c°/)k0 ,  and  in  this  case,  the  corresponding  expansion  parameter  is  indeed 
2 p  xxxx  (as  mentioned  above).  On  the  other  hand,  in  an  elastic  solid  we  have  a  field  with 
two  characteristic  wave  numbers:  k„  and  k,  .  No  matter  which  we  choose  as  our 
reference  wave  number  (or  equivalently  sound  speed),  we  will  have  a  non-favored  wave 

xxxx  The  factor  of  2  results  because  the  parabolic  equation  looks  like  an  expansion  of  I  +  2  . 
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number,  and  its  related  expansion  parameter  (l  —  %)  will  appear  in  the  parabolic 

equation.  For  example,  had  we  chosen  a  =  1 j[o)pQc% ) ,  then  k(r'  would  become  the 
reference  wave  number,  and  now  we  would  pick  up  the  expansion  parameter 

i-(<Ao)2yyyy. 

Thus,  we  see  that  we  have  encountered  a  new  attribute  that  must  appear  in  any  parabolic 
equation  (PE)  that  models  a  field  that  has  multiple  characteristic  wavelengths.  Such 
parabolic  equations  will  pick  up  intrinsic  expansion  parameters  that  are  connected  to  the 
need  for  the  PE  to  adjust  for  the  difference  between  its  reference  wave  number  and  one  or 
more  of  the  actual  physical  wave  numbers. 

Finally,  let  us  consider  a  numerical  example.  For  granite  and  limestone,  cs  ~  c  / 1.9 

typically,  and  the  expansion  parameter  1  ~(cs0/c^  =  1  - (1/1.9)'  =  0.723 .  Thus,  we  have 

an  intrinsic  expansion  parameter  that  is  fairly  close  to  1 .  Its  presence  explains  why  the 
parabolic  equation  for  elastic  solids  will  generally  require  fairly  high  orders. 


S.2  The  transformation  tying  the  parabolic  equation  field  x  to 
the  displacement  u 

This  appendix  derives  a  transformation  first  order  in  VT  that  connects  the  familiar  full- 
wave  displacement  vector  u  with  the  downrange  propagating  field  %  associated  with  the 
parabolic  equation.  The  transformation  as  recorded  in  Section  7.2  by  equation  (7.15) 
(with  definitions  (7.16)  and  (7.17))  is  derived  below. 

There  will  be  an  intermediate  result.  Consider  the  Hamiltonian  for  the  propagation  of  the 
displacement  vector  u  when  all  environmental  parameters  ( A,JLl,p  )  are  constant.  We 
discover  that  even  in  this  elementary  example,  a  fundamental  distinction  exists  between 
the  parabolic  equation  for  u  and  that  for  the  auxiliary  field  X  (which  unlike  u  will 
continue  to  propagate  according  to  a  parabolic  equation  even  when  range  dependence  is 


yyyy  Note  that  since  Cp  >  Cs  at  a  given  location,  we  run  a  very  real  risk  that  this  expansion  parameter  could 
become  greater  than  1 .  Generally,  the  expansion  parameter  will  be  bounded  by  1  if  we  choose  the  smallest 

s-wave  speed  as  the  reference  wave  speed  (and  so  a  =  I  [j ( Ojp()C(t  )  )•  Also  note  that  at  a  fluid-elastic 

interface,  the  smallest  s-wave  speed  will  go  to  zero,  and  our  expansion  parameters  go  to  1 .  This  is  much 
like  the  Dirichlet  or  Neumann  limit  in  acoustics,  and  it  indicates  that  it  will  be  quite  tricky  to  adapt  the 
parabolic  equation  derived  here  to  the  fluid-solid  interface. 
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added).  The  Hamiltonian  corresponding  to  u  is  the  desired  intermediate  result,  and  it 
will  be  derived  in  Section  S.2.1. 

This  intermediate  result  will  then  lead  to  our  final  result,  an  expression  for  \Ji2  ,  where 

X  °=  \/f5  ■  u  .  The  transformation  is  given  by  equation  (S.65)  (with  definitions  (S.64)  and 
(S.61)).  It  is  derived  in  Section  S.2.2. 

S.2.1  The  Hamiltonian  for  the  displacement  vector  u 

Below  in  Subsection  S.2.1,  it  is  shown  that,  neglecting  range  dependence  as  usual  and 
now  also  even  the  local  transverse  (e.g.,  vertical)  dependence  of  the  environmental 
parameters  A ,ju  (Lame  parameters)  and  p  (density),  we  can  take  the  square  root  of  the 
wave  equation  for  the  displacement  vector  u  ,  and  get  a  parabolic  equation  for  u  (see 
equations  (S.50),  (S.53)  and  (S.55)).  This  derivation  is  carried  out  in  Subsections  S.2.1. 3, 
S.2. 1 .5,  and  S.2.1 .6.  Subsection  S.2. 1 .3  perfonns  the  formal  square  root  operation, 
Subsection  S.2.1. 5  evaluates  a  dyadic  that  appears  in  the  result.  The  full  Hamiltonian 
contains  d/dx  operators  multiplied  by  the  expansion  parameter  Vr  and  so  the  d/dx 
operators  can  be  eliminated  by  iteration.  Subsection  S.2. 1 .6  performs  the  first  iteration. 

The  derivation  of  the  Hamiltonian  for  the  displacement  u  is  supplemented  three 
subsections.  Subsection  S.2.1.1  previews  an  important  finding  that  emerges  as  a 
byproduct  of  the  derivation:  The  displacement  vector  u  and  the  carrier  of  downrange 
flux  x  obey  different  conservation  rules  and  so  their  Hamiltonians  differ  in  an  important 
aspect-the  Hermiticity  property.  Subsection  S.2. 1 .4  analyzes  this  finding  the  context  in 
which  it  arises.  Subsection  S.2. 1 .2  introduces  the  aspects  of  the  elastic  wave  formalism 
that  will  be  used  in  the  derivation  of  the  Hamiltonian  for  u  . 


S.2.1.1  An  important  difference  between  the  displacement  u  and  the 
carrier  of  flux  x 


During  the  derivation  of  the  Hamiltonian  for  u  ,  it  will  be  noted  (in  Subsection  S.2.1 .4) 
that  we  cannot  use  this  Hamiltonian  as  the  Hamiltonian  for  the  Foldy-Wouthuysen  field 

X  ■  The  immediate  reason  is  that  the  transformation  operator  from  u  to  X,  yji2  (i.e., 
where  x  ^  V5  ■  u  )  does  not  commute  with  H ^  (the  standard  parabolic  equation  for  X ), 
even  given  all  our  stringent  simplifying  assumptions  about  the  local  environment  (i.e., 


H 


y,^ 


^  0  even  at  locations  where  none  of  the  environmental  parameters  are 


varying).  A  profound  property  associated  with  the  fact  that  the  transformation  operator 
y[i2  does  not  commute  with  the  Hamiltonian  (even  in  the  most  trivial  of  cases)  is  that  the 
magnitude  of  the  carrier  of  downrange  flux  x  '  X  must  be  conserved,  while  the 
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magnitude  of  the  displacement  vector  u  ■  u  is  not  (ultimately  because  of  beats  between 
the  pressure  and  the  shear  waves),  and  so  the  Hamiltonian  for  the  former  H %  must  be 

Hermitian,  while  the  Hamiltonian  for  the  latter  H-  need  not  be!  Thus,  the  transformation 
from  one  vector  to  the  other  cannot  commute  with  either  of  the  Hamiltonians  involved! 

(Or  more  precisely,  \ji2  cannot  commute  with  H- ,  and  4i2  cannot  commute  with 

A-) 


Recall  that  making  simplifying  assumptions  about  the  variability  of  the  environmental 
parameters  to  produce  a  parabolic  equation  that  is  valid  for  the  physical  wave  function, 
but  not  for  the  carrier  of  flux,  has  a  precedent  in  the  acoustic  case.  At  the  beginning  of 
the  discussion  of  the  stair  step  technique  in  Section  6.2.2,  we  found  that  for  the  range- 
independent  variable  density  ( Sp  ^  0  )  scenario, 


+  k2 


A  =  0: 


-7 


OX 


+  pVTAVT-k2,(l-n2)-A 


(As  always  in  this  study,  the  square  root  operator  is  understood  as  an  expansion,  this  time 
in  pVT  -(l  -  772  ) .)  This  equation  does  not  generalize  to  the  range-dependent  case. 

To  incorporate  range  dependence,  we  must  consider  the  carrier  of  flux  x  ■>  which 
propagates  according  to  the  somewhat  different  parabolic  equation  generated  by  the 

Foldy-Wouthuysen  transformation.  The  differences  between  the  propagation  equations 

■  ,2 

reflect  differences  in  the  two  fields’  conservation  properties.  \A\  is  conserved  relative  to 

the  metric  d"RT/ p ,  while  \x\  is  conserved  relative  to  the  metric  d~Rr  (see  Section 

6.2.2).  All  this  forms  a  prototype  for  the  properties  of  the  elastic  field  as  outlined 
immediately  above  in  the  first  paragraph  of  this  section. 


S.2.1.2  The  elastic  wave  equation 

Let  us  therefore  proceed  to  fully  develop  the  corresponding  scenario  for  the  elastic  wave. 
As  noted  above,  the  first  step  is  to  obtain  the  parabolic  equation  for  the  displacement 

vector  u  .  We  begin  with  equation  (S.3):  V  ■  f  =  -poru  ,  substitute  the  definition  of  f 
(equation  (S.2)),  and  locally  neglect  all  spatial  dependence  of  A  and  p  to  get 

2V  ( V  ■  if )  +  pV2u  +  //V  ( V  •  ii)  =  -poru 
(2  +  //)v(v  -u)  +  pS2u  +  poru  =  0 

and  expanding  the  gradients 
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(  d  -  V  d  -  }  (  d2  ) 

(A -h  p^  x - I- Vr  x - h  Vr  +1  p  — —  +  V2  -\- pco~  1  W  =  0 

V  dx  Jv  dx  J  ^dx'  J 

(A  +  //)  xx  — — t  +  (A  +  //)  Vrx  — — h  (A  +  //)  x  — —  Vr  +  (/l  +  p}  VrVj, 
ox'  ox  ox 


■u  =  0 


+1// — -  +  1//V2  +l/?ft»2 
ox 

r(/l  +  //)xx  +  l//l — 7  +  (/l  +  //)  — — x  +  x  — — +  (A  +  VrVj 

L  J  ox  V  ox  ox  J 

+1//V2  +  lpco2 


(A  +  2  p  0  0) 

[(/l  +  //)xx  +  l//]  =  0  p  0  =E 

l  0  0  //y 

is  a  3x3  diagonal  matrix  that  is  invertible  provided  p  +  0  .  Also  note  that  /l  +  2p  >  0 
(use  A  >  —  \p  from  reference  [156])  and  p  >  0 zzzz.  Note  that  for  this  diagonal  matrix, 

we  can  take  powers  and  reciprocals  simply  by  taking  powers  and  reciprocals  of  the 
components  along  the  diagonal. 

Thus  we  have  the  wave  equation  for  the  displacement  vector  u  : 

~  ~  d2  f  ~  d  3  _  A  -- 

E — -  +  (/!  +  //)  — — x  +  x— — Vy  +  (/l  +  p^ +  \pS7 T  +  \pco  w  =  0  . (S.49) 

dx'  V  dx  ox  J 


2222  It  is  interesting  to  consider  the  limit  p  =  0  .  We  have 
Axx%x  2  +  A(Vt  %x  X  +  X  %x  Vr  j  +  AVrVr  +  1  pQ)~  ■  W  =  0  and  taking  components: 

x  Aj^-ux  +  A-^{vt  ■  wr)  +  pco2ux  =  0 ;  AVT  +  AVT  (Vr  •  +  pco2uT  =  0  or 

X  ^ ( V  ■  u  )  +  ^j~Ux  =  0  and  Vr  ( V  ■  w)  +  UT  =  0  ,  and  so  (using  k~  =  )  we  have 

V  (V  ■  /7  j  +  k~U  =  0  .  Now,  noting  i7  =  VAj ( p(02  j  and  multiplying  through  by  the  constant  pOT  , 
we  have  V  ■  [yA/p^  +  [k2  /  ylj  =  0 ,  which  is  just  the  gradient  of  the  (time-independent)  acoustic 


wave  equation. 
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S.2.1.3  Formally  taking  the  “square  root”  of  the  elastic  wave  equation 

Next,  we  will  “get  rid”  of  E ,  the  operator  sitting  to  the  left  of  d2/dx2 ,  by  multiplying 
this  equation  from  the  left  by  E  . 

However,  before  doing  so,  let  us  address  an  important  issue.  In  a  superficially  similar 
situation  that  will  come  up  shortly,  we  will  take  pains  to  do  this  type  of  thing  in  as 
symmetric  a  way  as  possible.  In  the  present  case,  the  analogous  action  would  be  to 
multiply  (S.49)  by  E™  from  the  right  and  then  also  from  the  left.  Let  us  try  to  do  this, 
and  see  what  happens.  Multiply  the  equation  above  by  E from  the  right.  For  the 
diagonal  parts  of  (S.49),  it  is  trivial  to  see  that  multiplying  from  the  right  is  automatically 
the  same  as  multiplying  from  the  left.  In  this  case,  it  turns  out  that  a  similar  thing 
happens  when  we  multiply  the  non-diagonal  part  of  (S.49)  by  E~^2  from  the  right: 


Thus,  the  formalism  simply  kicks  E  ^  over  to  the  left.  Basically,  this  is  because  E  '1  is 
diagonal  and  because  the  only  free  index  against  which  to  perform  matrix  multiplication 
is  on  the  left.  Thus,  we  are  in  this  case  forced  to  eliminate  the  coefficient  of  d2/dx2  in 
(S.49)  E  by  multiplying  the  equation  by  E  1  from  the  left. 

Multiplying  (S.49)  from  the  left  by  FT1 ,  we  have 

f)2  -f-d  ^  ^  „  „ 

— -  +  (A  +  jU^ E  '■  Vr  — —  x  +  x— —  Vr  +(/l  +  jU^ E  ■  VyVj,  +  /lE^2tE  +  pco^E  1  -u  —  0, 

ox  ^  ox  ox  J 


or  formally  taking  the  square  root  of  this  equation  (with  the  sign  chosen  to  reflect 
downrange  propagation) 

f  _  3  3  _  \ 

pco  E  +(/l  -\- E  ■  — — x  +  x  — — Vr  „ 

—i — —  =  v  dx  ox  J  ■u=Hr-u . 

dx  1  ^  _  ** 

\  +(A  +  p)E  ■  V^V^,  +  ji/VjE 


(S.50) 


376 


Formally  speaking,  this  is  the  parabolic  equation  for  u  with  the  Hamiltonian  H- . 

5.2.1.4  The  Hamiltonian  for  u  is  neither  Hermitian  nor  is  it  the  same 
as  the  Hamiltonian  for  x 

Before  proceeding,  we  need  to  stop  and  consider  a  point  alluded  to  earlier.  By 
multiplying  the  non-diagonal  operator  (V7.  %xx  +  x %XVT )  from  the  left  by  E  1 ,  we  have 

created  a  non-symmetric  and  more  importantly  non-Hermitian  operator.  In  other  words, 
the  Hamiltonian  for  the  propagation  of  u  is  non-Hermitian — even  where  A ,p,p  are  all 

constants.  This  implies  that  the  quantity  J  (u*  ■  u^d2RT  is  not  conserved  during 

downrange  propagation.  To  get  an  intuitive  feel  for  this  result,  break  a  basic  plane  wave 

u  into  components  that  are  pressure  and  shear  waves:  u  =  up  +  us  =  upe'kpX  +  use'ks'x 

(where  the  time  dependence  e~,m  has  been  taken  out).  Now, 

_  I _  |2  | _  i2  _  _  /— *  _  i(kp—ks Vx \ 
u  •u  =  \un\  +  \u  +2K Q\un  -u,e 

\  P\  \  s\  \  Po  50  J 

or  taking  J  d2RT  ,  we  get  something  proportional  to 

Re  S(kpT-ksT)e[kpx~ksi]x  +..=S{kpT-ksT)cos((kpx-ksx)x)  +  ... 

where  the  ellipses  stand  for  terms  that  do  not  vary  with  the  range  (they  are  infinite,  but 
that  does  not  matter;  also  note  that  kpx  -  ksx  ^  0  even  though  the  transverse  components 

are  equal  because  the  sound  speeds  are  different).  Thus,  we  pick  up  beats  between  the 
pressure  and  shear  waves,  and  u*  ■  u  is  not  a  conserved  quantity — even  for  a  plane  wave 
where  A,ju,p  are  all  constants!  That  is  the  reason  why  the  Hamiltonian  H-  cannot  be 
Hermitian. 

This  allows  us  to  anticipate  a  very  interesting  result.  At  least  for  constant  A ,fi,p  ,  there 
exists  a  parabolic  equation  to  propagate  u  ,  but  it  cannot  be  the  same  parabolic  equation 
as  that  for  % ,  because  for  one-way  propagation,  X  X  is  conserved,  and  so  its 
corresponding  Hamiltonian  H x  must  be  Hennitian.  As  noted  above,  the  fact  that  X 
does  not  obey  the  parabolic  equation  for  u  also  implies  that  as  we  consider  the 
transformation  X  x  Vf?  ■  u  ,  we  will  have  H- ,  \fi2  ^  0 .  F inally ,  note  that  fl 2 ,  \li2  will 

be  Hennitian,  although  H-  is  not. 

5.2.1.5  Evaluating  the  dyadic  E  in  the  Hamiltonian  for  u 
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Now,  let  us  proceed  to  evaluate  the  Hamiltonian  H-  given  in  equation  (S.50).  We  have 
.  du 


-i  —  =  jpa)2E~1  ■ 

ox 


1+ 


A  +  p 
par 

A+2p-p  _ 
par 

V  -  V 
7kl  7k; 


Vr  -^-x  +  x^-VT 
OX  ox 


+ 


A  +  // 
par 

1/  - 1/ 

7kl  A? 


V  V  H — — — ,  (S.51) 
pa>- 


where  the  square  root  is  understood  as  a  Taylor  series  expansion. 
Now, 


|2  17-1 


pa>E 


f  par 
A  +  2p 


0 

0 


0 

pat2 

M 

0 


0 


par 

M  ) 


fk;  0  0 

0  k)  0 

o  o  k: 


and  noting  that  we  can  take  powers  of  a  diagonal  matrix  by  taking  powers  of  its  non-zero 
components,  we  have 

(k„  0  0^ 

yjparE^  = 


0  ks  0 
0  0  k 


:  k  =  k  xx  +  ks  (yy  +  zz) . 


(S.52) 


sj 


Substituting  (S.52)  into  (S.51),  we  have 


.dii  r 
-i  —  =  k  ■ .  |1  + 
ox 


1  _  1 

\KP  Ks 


a  „  „  a  -  ^ 


V  j,  — —  x  r  x  — —  V  j 
v  ox  ox  j 


+ 


Aj_ 

kK  ks 


-  -  V2 

VrVr  +^-u  .  (S.53) 


This  parabolic  equation  contains  an  explicit  d/dx  in  the  square  root  operator  (i.e.,  the 
Hamiltonian  H- ) — even  when  we  remove  all  spatial  dependence  on  the  environmental 
parameters  A,p  and  p  .  This  differs  from  what  we  found  in  the  acoustic  and 
electromagnetic  cases  ( d/dx  may  appear  in  the  transformation  between  the  physical  and 
Foldy-Wouthuysen  fields,  but  until  now  not  in  the  Hamiltonian).  Fortunately,  d/dx  is 
multiplied  by  the  “small”  expansion  parameter  V7. ,  and  so  we  can  iterate  this  equation 
and  still  generate  a  series  that  forms  a  parabolic  equation  for  u  .  This  series  will  be  an 
expansion  in  a  parameter  of  the  basic  form  VT/k  ,  which  is  a  typical  expansion  parameter 
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to  find  in  a  parabolic  equation.  The  key  thing  is  that  the  d/dx  is  not  in  the  0th  order  term, 
where  it  would  pose  insurmountable  problems. 


S.2.1.6  The  0th-order  Hamiltonian  for 


Now,  our  0th-order  (in  Vr  )  result  is  H.  =  k  =>  1  —  =  —  and 

ox  -i 

„  -  d  „  d  „  ic  , 
x  1  —  =  x—  =  x-  —  =  ikx. 

OX  OX  -7 


Thus,  one  iteration  of  (S.53)  gives  us 
//,  =  k  ■ 


1  + 


J_A 


[y Tikpx  +  ikpxVT )  +  2nd  order 


:  k  ■ 


i+l 

2 


Kkfksj 


kp  (/Vrx  +  x7Vr) 


+ 


o(vj) 


(S.54) 


Now,  note  that  k  ■  (/'V7.a'  +  x/V.,.  j  =  /f  V.(i'  +  ikpxVT ,  so  that 


Hr  =  k+i- 
2 


rl_  P 
yK~klj 


kp  [k  V Tx  +  kpxVT^  +  0(V^.) .  (S.55) 


Recall  that  this  is  the  Hamiltonian  for  the  displacement  vector  u  (where  the 
environmental  parameters  are  locally  held  constant.).  Now,  we  are  ready  to  use  this 

result  to  examine  yji2  where  recall  X  ^  •  u  . 


5.2.2  Using  the  Hamiltonian  for  u  to  obtain  the  endpoint  correction 
for  j 

Now  we  use  the  result  of  Appendix  S.2.1  to  obtain  the  transformation  between  the 
displacement  vector  u  and  the  parabolic  equation  field  X  ■  Subsection  S.2.2. 1  shows  that 

the  vector  fields  are  connected  by  an  equation  of  the  form  X  =  Vf?  ■  u  .  Subsection 

5.2.2. 2  evaluates  the  dyadic  iA  and  Subsection  S.2.2. 3  deduces  its  square  root.  Finally, 
Subsection  S.2.2.4  considers  the  “minimal  connection”  between  U  and  x ■ 
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S.2.2.1  The  basic  formalism  that  connects  the  carrier  of  flux  %  with 
the  displacement  u 


Recall  equation  (S.4):  Sx  =  |Im[w*  •  f  ■  xj .  Next,  we  want  to  write  this  in  terms  of  a 
“central  operator”  2(%x,Vr;A,fi,/?'J  so  that  (to  within  an  integration  by  parts) 


S  =  —  Im 
*  2 


u  ■  A- u 


(S.56) 


From  (S.5)  with  6  =  0  (i.e.,  one-way  downrange  propagation),  we  know  that  XX~  a^x 
(where  a  is  a  free  parameter  in  ansatz  (S.l);  for  reasons  discussed  in  Section  S.  1.1,  it  is 
usually  chosen  to  be  a  =  l/(ry/?0Co))-  The  expression  for  the  downrange  flux  given  in 
(S.56)  then  implies  that 


(S.57) 


To  see  this,  begin  with  \X  ■  X  =  {4i2  ■  wj  ■  \ji2  ■  u  ,  and  note  that  \ji2  invariably  turns 

out  to  be  Hermitian  (at  least  if  we  as  usual  neglect  range  dependence  locally  at  the 
endpoints).  This  means  that  we  can  take  the  product  and  break  it  into  halves  and  apply 
Hermiticity  (i.e.,  integrate  by  parts,  complex  conjugate,  transpose  matrices)  in  different 
directions  to  getaaaaa 


-X  X  =  UiA-u\ 


)  i  r  _  * 

■yJiA-u  =—  ( iA  ■  u  j  ■  u  + 


u  ■ iA-u 


1 

2  L 


u  A-u-u(A-u) 
=  \ra^u*  -A  -u~^  =  2Sx 


(S.58) 


which  indeed  agrees  with  equation  (S.56).  Note  that  the  operation  in  equation  (S.58) 
reflects  the  procedure  pursued  for  the  acoustic  field  in  Appendix  A.  It  is  also  worth 
noting  that  in  (R.29),  roughly  speaking  the  analogous  equation  for  the  electromagnetic 

field,  there  was  no  “/  ”  (i.e.,  in  Appendix  R.2,  -\f/l  was  the  relevant  Hermitian  operator 

rather  than  yji2  ),  because  there  we  needed  the  real  part  in  order  to  obtain  the  time- 
averaged  Poynting  vector.  (In  that  case,  it  also  happened  to  be  quite  easy  to  dispense 
with  the  need  to  explicitly  consider  only  the  real  part  at  this  stage.) 


aaaaa  Note  that  since  the  Hermiticity  property  for  differential  operators  is  used,  this  result  assumes  an 
implicit  integration  in  the  transverse  space. 
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S.2.2.2  Evaluating  the  dyadic  iA 


Now,  to  find  iA  and  so  yJiA  ,  we  start  with  if  ■  x ,  and  rewrite  it  as  iA  ■  u . 
a  modified  form  of  equation  (S.8): 

f  \ 


if -x  =  (A +  2  fi)xi^L+ Aix 
ox 


can 

replace 
with  u 


+  ///V Tu  x  +  [A  i 


duT 

dx 


(l-xxyfift 


(A  +  2ju)  xx  ■  H-  +  AixVT  +  juiVTx  +  ju(  1  -  xx)  •4 

iA 


and  so 


iA  =  [A  +  2ju)x(x- H^  +  i^AxV  T  +  /J57Tx)  +  //  ( 1  —  xx)- Ht 


Now  from  equation  (S.55),  we  have 

H-  =  k  +ia(ky  Tx  +  A:pxVr)  +  0(v2T) 


where 


Thus 


and  so 


xx  •  H-  =  kpxx  +  aikpxVT  +  O  ( V2r ) 

(l  -  xx)  ■  H-  =  ks  (l  -  xx)  +  aiksVTx  +  O  ( V^. ) 


iA  =  (A  +  2  ju)kpxx  +  (A  +  2 /u)  aik  pxS7  T  +  AixVr 
+fiks  (1  -  xx)  +  juaiky  Tx  +  /uN Tx  +  O  ( Vj. ) 

_ 0th  order _ 

=  (A  +  2ju)kpxx  +  juks  (1- xx)  +  \^[A  +  2 ju) ak p  +/lj/xV 
+  ^juccks  +  //] iVTx  +  O  ^  Vj  j 


Now,  the  0th  order  term  is 


We  begin  with 


(S.59) 
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f 


(A  +  2 ju)kpxx  +  pks  (1  -  xx ) 


pco 


U+2/i 

(2  +  2//)  kp 
0 
0 


0 

MK 

0 


0 

0 

M  K 


\pa_ 

V  J 


and  so 


[A  +  2p)k  xx  +  juks  (l-xx)  =  par 


A  +  2p 
par 

0 

0 


=  par 


o  V 


0  0 


M 


pa r 
0 


0  0 


V 


M 


par 


2  1 

=  par— 
k 


(S.60) 


Since  it  will  play  a  prominent  role  below,  let  us  pause  to  examine  the  square  root  of  the 
matrix  [l jk\ .  Note  that  [l/A'j  is  a  diagonal  matrix,  and  as  noted  above  (just  above 

equation  (S.49)),  for  such  a  diagonal  matrix,  we  can  take  powers  and  reciprocals  simply 
by  taking  powers  and  reciprocals  of  the  components  along  the  diagonal: 


Ak 


0 

(jr, 

0 


0 


(S.61) 


Going  back  to  equation  (S.60)  and  continuing  our  calculation,  we  have 

iA  =  par^-  +  'y(A  +  2p)akp  + A^ixV  T +[paks  +  p\iVTj c  +  o(V^.) 


,21 


=  pO) '  —  +  aixW  T  +  biW  Tx  +  O  ( V  \ ) 
k 
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S.2.2.3  Obtaining  \[il  from  iA 


Next,  we  will  pull  out  a  factor  of  po)2[l/k] ,  take  the  square  root  and  then  expand  in 

powers  of  Vr .  However,  this  time  (unlike  above  when  we  pulled  out  the  E  in  equation 

(S.49))  it  matters  how  we  pull  out  [l /k] .  We  will  pull  out  [l /k]  in  as  a  symmetric  a  way 

as  possible.  Indeed,  later  in  the  calculation  we  will  further  symmetrize  \[i2  in  order  to 
make  it  completely  symmetric  and  consequently  Hermitian. 


Thus,  let  us  pull  out  [l /k]  in  as  symmetric  a  way  as  possible: 

iA  =  pco2J-j~-  J-j-  +  aixV T  +  biVTx  +  O  (V^, ) 


and  expand 
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yji2  =  7  pco2 


\k  j 


1  +  I 
2 


1 

+  - 
2 


X 

— /2 

5/  jfv  . 

X 

2  V  T 

pco 

X 

~hi  W  " 

nn 

ykj 

- Tx- 

pco~ 

X* 

KkJ 


+ 


0(v*). 


Thus, 


\l  por 

X 

*  1 

+  — 

X 

-A 

■  ™xVT- 

X 

2 

ykj 

Jpo>2 

yk, 

1 

X 

^  ~hi  V  - 

x 

+  - 
2 

ykj 

■  . — =VTr- 

po>! 

+ 


O(V’) 


Now,  recall  that  [l fk\  is  a  diagonal  matrix,  and  so  it  is  easy  to  take  the  reciprocal.  The 
=  k'^  =  k'fxx  +  k'f'  ( yy  +  zz).  Thus,  we  have 


result  is  [l Jk] 


or 


=  Jp> 


or 


yk  j 


\k  j 


+  - -£L=  ky  ■  x  VT  ■  k  K  +  - ~^=  ky  ■  VT  +  O  (' V2  ) 


ipor 


2 

l  k*k, 


*T*  VrkX 


ipor 


VTkX  kp 4* 


+  — 


y[P' 


CO 


,2  L 


axVT  +  b VTx 


+ 


o(K) 


Now  we  symmetrize  yft  (formally,  this  would  involve  splitting  operators,  integrating  by 
parts  and  rearranging  terms): 

xW T  — >  ^-(xVr  +  Vrx)  ;  VTx  — >  ^-(xVr  +  Vrx) , 


and 


/ 

X 

.  1  . 

f  a  +  b^ 

rV  4-  V  r 

\  P'1) 

2  yj por 

v  2  v 

A  V  jt  1  V  j.A 

+ 


o(vp 


=  4p 


or 


(  1  Y:  V pco 1  (kpKf  / a  +  hA 

vkJ  2  2 


(S.62) 


v  P®  y 


xVj  +  V^x  +  O^V2j 


Next  we  evaluate  ( a  +  b^j ( /?&r ) : 
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K 


l/*s2  1 Ik] 


a  +  b_A  +  2/u^/  A  +  2/j  ju  jU 

~  —  2  ^k„  2  2~  ^  2 ~  aks  +  2 


j/£ 

A 


/?<«  /?<y 


/96T  /?6T  /9&T 


par 


/  P‘0 

1  ,  1  1  akK 

■■— Tak  +— r-— r+— ^  =  or 

K  p  K  k)  k) 


r 


i  i 

- 1 - 

k*J 


/  pyr 

A 


1  1 

H — ^ - 7 

kl  k: 


Now,  recall  equation  (S.59) 


a  =  — 
2 


\ _ 1_ 

~2  Ti 


xK  K)p 


,  1  1  2a 

k„  or  — — -  = — . 

K  ^  kp 


so 


a  +  b 
par 


a 


/  1  1  2A 
- 1 - 1 - 

xK  K  kPj 


a 


r  3  1  A 

- 1 - 

vkP  Kj 


and  substituting  the  definition  of  a  equation  (S.59) 


a  +  b  _  1 
par  2 


J_v 


k  ^ 

3  +  — 

Kj 


(S.63) 


This  quantity  is  positive:  with  k2p  =  par /(/L  +  2p) ,  kl  =  par / //  and  X>-2pj2  (see 
reference  [156]),  we  have  k2p<k2s,  \/k2p>\/kl  and  so  {a  +  b^  j{por)  >  0  . 


Substituting  (S.63)  into  (S.62),  we  have 

V/W 


\^i2=^[pco2 


\k  J 


+  - 


lor 


( kPksY 


1  _  1 

T  3T 


Kj 


V  r  X 

3  +  — 


V  ksJ 


xVT  +  VTx 


+ 


0(Vy).(S.64) 


From  (S.57),  with  the  usual  choice  a  =  1  j ( a>p{)c\ ) ,  we  have 


— — -y[i2  ■  u  . 


Wo 


(S.65) 


If  we  set  I)  =  \fi2  ,  equation  (S.65)  with  definitions  (S.64)  and  (S.61)  match  the  result 
given  in  Section  7.2  (i.e.,  the  transformation  given  by  equation  (7.15)  with  definitions 
(7.16)  and  (7.17)). 

S.2.2.4  The  minimal  connection 
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Equation  (S.65)  (with  (S.64)  and  (S.61))  should  probably  already  be  considered  to  be  the 
minimal  endpoint  correction.  To  be  specific,  even  in  the  minimal  correction,  we  should 

probably  include  1st  order  in  VT .  In  the  endpoint  transformations  for  the  acoustic  and 

electromagnetic  fields,  the  lowest  order  in  VT  was  2nd  order,  and  then  it  made  sense  to 

consider  the  0th  order  in  VT  to  be  the  minimal  correction. 
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